Stochastic layer-wise shuffle for improving Vision Mamba training

Jul 18, 2025·

Zizheng Huang

,

Haoxing Chen

,

Jiaqi Li

,

Jun Lan

,

Huijia Zhu

,

Weiqiang Wang

Limin Wang

Limin Wang

· 0 min read

Type

Conference paper

Publication

Proceedings of the International Conference on Machine Learning

Last updated on Jul 18, 2025

Limin Wang

Authors

Nanjing University

← On the tension between Byzantine robustness and no-attack accuracy in distributed learning Jul 18, 2025

AutoLUT: LUT-based image super-resolution with automatic sampling and adaptive residual learning Apr 20, 2025 →