Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2508.21113
Cited By
v1
v2 (latest)
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
28 August 2025
Qi Yang
Bolin Ni
Shiming Xiang
Han Hu
Houwen Peng
Jie Jiang
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (100 upvotes)
Github (93★)
Papers citing
"R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning"
1 / 1 papers shown
Title
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition
Sina J. Semnani
Han Zhang
Xinyan He
Merve Tekgürler
Monica S. Lam
3DV
32
0
0
24 Sep 2025
1