Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation Inclusion AI Bowen Ma Cheng Zou C. Yan Chunxiang Jin ...Zhiqiang Fang Zhihao Qiu Ziyuan Huang Zizheng Yang Zhengyu He |
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence J. Yang Wei Emma Zhang Shark Liu J. Wu Shawn Guo ...Zizheng Zhan Jiajun Zhang Jie Zhang Zhaoxiang Zhang Bo Zheng |
NOSA: Native and Offloadable Sparse Attention Yuxiang Huang Chaojun Xiao Xu Han Zhiyuan Liu Zhou Su ...Hengyu Zhao Yudong Wang Chaojun Xiao Xu Han Zhiyuan Liu |
DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context ParallelismSymposium on Operating Systems Principles (SOSP), 2025 |