ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2507.16812
  4. Cited By
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
v1v2 (latest)

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

22 July 2025
Run-Ze Fan
Zengzhi Wang
Pengfei Liu
    LRM
ArXiv (abs)PDFHTMLHuggingFace (45 upvotes)Github (714★)

Papers citing "MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning"

9 / 9 papers shown
SkyRL-Agent: Efficient RL Training for Multi-turn LLM Agent
Shiyi Cao
Dacheng Li
Fangzhou Zhao
Shuo Yuan
Sumanth R. Hegde
...
Richard Liaw
Philipp Moritz
Matei A. Zaharia
Joseph E. Gonzalez
Ion Stoica
136
2
0
20 Nov 2025
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Baolong Bi
Shenghua Liu
Yiwei Wang
Siqian Tong
Lingrui Mei
Yuyao Ge
Yilong Xu
Jiafeng Guo
Xueqi Cheng
OffRLLRM
266
5
0
15 Nov 2025
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
MiroMind Team
Song Bai
Lidong Bing
Carson Chen
Guanzheng Chen
...
T. Zhao
Xizhou Zhu
Yanpeng Zhou
Y. Zhang
Zhi Zhu
LLMAGLRMVLM
302
5
0
14 Nov 2025
MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
Yu Zhang
Hui-Ling Zhen
Mingxuan Yuan
Bei Yu
MQ
325
0
0
08 Nov 2025
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
Xuanzhong Chen
Zile Qiao
Guoxin Chen
L. Su
Zhen Zhang
Xinyu Wang
Pengjun Xie
Fei Huang
Jingren Zhou
Yong Jiang
LLMAGELM
168
3
0
28 Oct 2025
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
Wenkai Yang
Weijie Liu
Ruobing Xie
Yiju Guo
Lulu Wu
Saiyong Yang
Yankai Lin
LRM
127
1
0
16 Oct 2025
Demystifying Reinforcement Learning in Agentic Reasoning
Demystifying Reinforcement Learning in Agentic Reasoning
Zhaochen Yu
Ling Yang
Jiaru Zou
Shuicheng Yan
Mengdi Wang
AI4TSLRM
269
6
0
13 Oct 2025
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping
Shuang Chen
Yue Guo
Yimeng Ye
Shijue Huang
Wenbo Hu
Haoxi Li
Manyuan Zhang
Jiayu Chen
Song Guo
Nanyun Peng
LRM
147
3
0
09 Oct 2025
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Leitian Tao
I. Kulikov
Swarnadeep Saha
Tianlu Wang
Jing Xu
Yixuan Li
Jason Weston
Ping Yu
OffRLLRM
255
5
0
08 Oct 2025
1
Page 1 of 1