ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.15383
  4. Cited By

Qwen2.5-1M Technical Report

28 January 2025
A. Yang
Bowen Yu
Chong Li
Dayiheng Liu
Fei Huang
Haoyan Huang
Jiandong Jiang
Jianhong Tu
J. Zhang
Jingren Zhou
Junyang Lin
K. Dang
Kexin Yang
Le Yu
Mei-Jiu Li
Minmin Sun
Qin Zhu
Rui Men
Tao He
Weijia Xu
Wenbiao Yin
Wenyuan Yu
Xiafei Qiu
Xingzhang Ren
Xinlong Yang
Y. Li
Zhiying Xu
Z. Zhang
ArXivPDFHTML

Papers citing "Qwen2.5-1M Technical Report"

7 / 7 papers shown
Title
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue
Zhiqi Chen
Rui Lu
Andrew Zhao
Zhaokai Wang
Yang Yue
Shiji Song
Gao Huang
ReLM
LRM
58
12
0
18 Apr 2025
SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling
SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling
Krishna C. Puvvada
Faisal Ladhak
Santiago Akle Serrano
Cheng-Ping Hsieh
Shantanu Acharya
...
Fei Jia
Samuel Kriman
Simeng Sun
Dima Rekesh
Boris Ginsburg
RALM
60
0
0
11 Apr 2025
From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models
From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models
C. Xu
Ming-Yu Liu
P. Xu
Z. Liu
Wei Ping
M. Shoeybi
Bo Li
Bryan Catanzaro
22
1
0
08 Apr 2025
The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances
The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances
Yining Wang
Yixuan Wang
Xi Li
Mi Zhang
Geng Hong
Min Yang
AAML
HILM
67
0
0
01 Apr 2025
ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models
ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models
Chung-En Sun
Ge Yan
Tsui-Wei Weng
KELM
LRM
62
0
0
27 Mar 2025
Zero-Shot Multi-Label Classification of Bangla Documents: Large Decoders Vs. Classic Encoders
Souvika Sarkar
M. Hasan
S. Karmaker
41
0
0
04 Mar 2025
System Message Generation for User Preferences using Open-Source Models
System Message Generation for User Preferences using Open-Source Models
Minbyul Jeong
Jungho Cho
Minsoo Khang
Dawoon Jung
Teakgyu Hong
41
0
0
17 Feb 2025
1