ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.07900
  4. Cited By
MiniCPM4: Ultra-Efficient LLMs on End Devices
v1v2 (latest)

MiniCPM4: Ultra-Efficient LLMs on End Devices

9 June 2025
MiniCPM Team
Chaojun Xiao
Yuxuan Li
Xu Han
Yuzhuo Bai
Jie Cai
Wei Xu
Wentong Chen
Xin Cong
Ganqu Cui
Ning Ding
Shengdan Fan
Yewei Fang
Z. Fu
Wenyu Guan
Yitong Guan
Junshao Guo
Yufeng Han
Bingxiang He
Yuxiang Huang
Cunliang Kong
Cunliang Kong
Siyuan Li
Siyuan Li
Yanghao Li
Yishan Li
Zhen Li
Dan Liu
Zhen Li
Y. Lin
Xiang Long
Quanyu Lu
Yaxi Lu
Peiyan Luo
Hongya Lyu
Litu Ou
Yinxu Pan
Zekai Qu
Qundong Shi
Zijun Song
Jiayuan Su
Zhou Su
Ao Sun
Xianghui Sun
Peijun Tang
Fangzheng Wang
Feng Wang
Peijun Tang
Yudong Wang
Yesai Wu
S. Wang
Jie Xie
Zihao Xie
Y. Yan
Zhenyu Xiao
Kaihuo Zhang
Lei Zhang
L. Zhang
Xueren Zhang
Qixin Xu
H. Vicky Zhao
Weilin Zhao
Lei Zhang
Yuanqian Zhao
Zhi Zheng
Yudi Zhang
Jie Zhou
Wei Zhou
Weilun Zhao
Zixuan Zhou
Zhiyuan Liu
Chuyue Zhou
Ge Zhou
Jie Zhou
Wei Zhou
Yanghao Zhou
Zihan Zhou
Z. Zhou
Zhiyuan Liu
Guoyang Zeng
Chao Jia
Dahai Li
Maosong Sun
    MLLM
ArXiv (abs)PDFHTMLHuggingFace (83 upvotes)

Papers citing "MiniCPM4: Ultra-Efficient LLMs on End Devices"

14 / 14 papers shown
Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM
Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM
Yang Liu
Xiaolong Zhong
Ling Jiang
LLMAGMUMoELRM
382
0
0
23 Nov 2025
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
Tianyu Fu
Yichen You
Z. Chen
Guohao Dai
Huazhong Yang
Yu Wang
LRM
206
1
0
11 Nov 2025
Kimi Linear: An Expressive, Efficient Attention Architecture
Kimi Linear: An Expressive, Efficient Attention Architecture
Kimi Team
Yu Zhang
Zongyu Lin
Xingcheng Yao
J. Hu
...
Guokun Lai
Yuxin Wu
Xinyu Zhou
Zhilin Yang
Yulun Du
143
20
0
30 Oct 2025
MoPHES:Leveraging on-device LLMs as Agent for Mobile Psychological Health Evaluation and Support
MoPHES:Leveraging on-device LLMs as Agent for Mobile Psychological Health Evaluation and Support
Xun Wei
Pukai Zhou
Zeyu Wang
AI4MH
207
0
0
17 Oct 2025
BitNet Distillation
BitNet Distillation
Xun Wu
Shaohan Huang
Wenhui Wang
Ting Song
Li Dong
Yan Xia
Furu Wei
MQ
185
0
0
15 Oct 2025
Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
Jiuan Zhou
Yu Cheng
Yuan Xie
Z. Yin
128
1
0
08 Oct 2025
Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation
Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation
Miao Rang
Zhenni Bi
Hang Zhou
Hanting Chen
An Xiao
Tianyu Guo
Kai Han
Xinghao Chen
Yunhe Wang
168
2
0
30 Sep 2025
ProxyAttn: Guided Sparse Attention via Representative Heads
ProxyAttn: Guided Sparse Attention via Representative Heads
Yixuan Wang
H. He
Siqi Bao
H. Wu
Haifeng Wang
Qingfu Zhu
Wanxiang Che
156
1
0
29 Sep 2025
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Yixuan Zhou
Guoyang Zeng
Xin Liu
Xiang Li
Renjie Yu
...
Weiyue Sun
Jiancheng Gui
Kehan Li
Z. Wu
Zhiyuan Liu
143
5
0
29 Sep 2025
Tequila: Trapping-free Ternary Quantization for Large Language Models
Tequila: Trapping-free Ternary Quantization for Large Language Models
Hong Huang
Decheng Wu
Rui Cen
Guanghua Yu
Z. Li
Kai Liu
Jianchen Zhu
Peng Chen
Xue Liu
Dapeng Wu
MQ
272
3
0
28 Sep 2025
Predicting LLM Reasoning Performance with Small Proxy Model
Predicting LLM Reasoning Performance with Small Proxy Model
Woosung Koh
Juyoung Suk
Sungjun Han
Se-Young Yun
Jay Shin
LRMAI4CE
280
0
0
25 Sep 2025
E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model
E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model
Ronghao Lin
Shuai Shen
Weipeng Hu
Qiaolin He
Aolin Xiong
Li Huang
Haifeng Hu
Y. Tan
118
0
0
18 Aug 2025
iFairy: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$
iFairy: the First 2-bit Complex LLM with All Parameters in {±1,±i}\{\pm1, \pm i\}{±1,±i}
Feiyu Wang
Guoan Wang
Yihao Zhang
S. Wang
Weitao Li
Bokai Huang
Shimao Chen
Z. L. Jiang
Rui Xu
Tong Yang
MQ
245
6
0
07 Aug 2025
AGORA: Incentivizing Group Emergence Capability in LLMs via Group Distillation
AGORA: Incentivizing Group Emergence Capability in LLMs via Group Distillation
Ren Zhuang
Ben Wang
Shuifa Sun
LRM
130
0
0
25 Jul 2025
1
Page 1 of 1