Contextual AD narration with interleaved multimodal sequenceApr 20, 2025·Hanlin Wang,Zhan Tong,Kecheng Zheng,Yujun ShenLimin Wang· 0 min read Cite URLTypeConference paperPublicationProceedings of the IEEE/CVF Conference on Computer Vision and Pattern RecognitionLast updated on Apr 20, 2025AuthorsLimin WangNanjing University← CATANet: efficient content-aware token aggregation for lightweight image super-resolution Apr 20, 2025LeviTor: 3D trajectory oriented image-to-video synthesis Apr 20, 2025 →