Xiangyu Zeng, Kunchang Li, Chenting Wang, Xinhao Li, Tianxiang Jiang, Ziang Yan, Songze Li, Yansong Shi, Zhengrong Yue, Yi Wang, Yali Wang, Yu Qiao, Limin Wang
(2025).
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning.
The Thirteenth International Conference on Learning Representations.