南京大学大模型研究协同创新中心 Large Model Innovation Center
  • Home
  • News
  • Research
  • Publication
  • 中文 (简体)

Contextual AD narration with interleaved multimodal sequence

Jan 1, 2025·
Hanlin Wang
,
Zhan Tong
,
Kecheng Zheng
,
Yujun Shen
Limin Wang
Limin Wang
· 0 min read
Cite URL
Type
Conference paper
Publication
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Last updated on Jan 1, 2025
Limin Wang
Authors
Limin Wang
Nanjing University

← CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Jan 1, 2025
LeviTor: 3D trajectory oriented image-to-video synthesis Jan 1, 2025 →
Languages:
English
中文 (简体)

© 2025 Large Model Innovation Center, Nanjing University. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.