Task preference optimization: improving multimodal large language models with vision task alignment2025年1月1日·Ziang Yan,Zhilin Li,Yinan He,Chenting Wang,Kunchang Li,Xinhao Li,Xiangyu Zeng,Zilei Wang,Yali Wang,Yu Qiao· 0 分钟阅读时长 引用 URL类型会议文章出版物Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition最近更新于 2025年1月1日← Steady progress beats stagnation: mutual aid of foundation and conventional models in mixed domain semi-supervised medical image segmentation 2025年1月1日Taste more, taste better: diverse data and strong model boost semi-supervised crowd counting 2025年1月1日 →