Skip to main content
Visual Computing Center
VCC
Visual Computing Center
VLMs
Towards Scalable and Structured Understanding in Visual LLMs
Mohamed Elhoseiny, Associate Professor, Computer Science
Feb 23, 12:00
-
13:00
B9 L2 R2325
LLM
Visual Language Models
VLMs
visual computing
In this talk, we explore a suite of recent advances toward scalable, structured video comprehension using Large Vision Language Models (Video LLMs).