BradyFU / Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
10.88k stars 721 forks source link

Add LITA: Language Instructed Temporal-Localization Assistant #143

Open tongda opened 3 months ago

tongda commented 3 months ago

A nice job focusing on temporal localization when generating captions or answer questions about a video.