MobileViCLIP: an efficient video-text model for mobile devices

2025年8月12日·

Min Yang

,

Zihan Jia

,

Zhilin Dai

,

Sheng Guo

Limin Wang

Limin Wang

· 0 分钟阅读时长

类型

出版物

Proceedings of the IEEE/CVF International Conference on Computer Vision

最近更新于 2025年8月12日

Limin Wang

Authors

← Make your training flexible: towards deployment-efficient video models 2025年8月12日

p-MoD: building mixture-of-depths MLLMs via progressive ratio decay 2025年8月12日 →