Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2507.16812
Cited By

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

v1v2 (latest)

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

22 July 2025

ArXiv (abs)PDF HTML HuggingFace (45 upvotes)Github (714★)

Papers citing "MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning"

9 / 9 papers shown

SkyRL-Agent: Efficient RL Training for Multi-turn LLM Agent

Sumanth R. Hegde

...

Matei A. Zaharia

Joseph E. Gonzalez

136

2

0

20 Nov 2025

Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning

Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning

266

5

0

15 Nov 2025

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

...

302

5

0

14 Nov 2025

MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling

MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling

325

0

0

08 Nov 2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

168

3

0

28 Oct 2025

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

127

1

0

16 Oct 2025

Demystifying Reinforcement Learning in Agentic Reasoning

Demystifying Reinforcement Learning in Agentic Reasoning

269

6

0

13 Oct 2025

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

147

3

0

09 Oct 2025

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Swarnadeep Saha

255

5

0

08 Oct 2025

Page 1 of 1