ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18454
  4. Cited By
Hybrid Latent Reasoning via Reinforcement Learning

Hybrid Latent Reasoning via Reinforcement Learning

24 May 2025
Zhenrui Yue
Bowen Jin
Huimin Zeng
Honglei Zhuang
Zhen Qin
Jinsung Yoon
Lanyu Shang
Jiawei Han
Dong Wang
    OffRLBDLLRM
ArXiv (abs)PDFHTML

Papers citing "Hybrid Latent Reasoning via Reinforcement Learning"

8 / 8 papers shown
Title
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin
Hansi Zeng
Zhenrui Yue
Dong Wang
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
RALMReLMKELMOffRLAI4TSLRM
226
122
0
12 Mar 2025
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
Zhenyi Shen
Hanqi Yan
Linhai Zhang
Zhanghao Hu
Yali Du
Yulan He
LRM
170
27
0
28 Feb 2025
Vector-ICL: In-context Learning with Continuous Vector Representations
Vector-ICL: In-context Learning with Continuous Vector Representations
Yufan Zhuang
Chandan Singh
Liyuan Liu
Jingbo Shang
Jianfeng Gao
134
7
0
21 Feb 2025
LLM Pretraining with Continuous Concepts
LLM Pretraining with Continuous Concepts
Jihoon Tack
Jack Lanchantin
Jane Dwivedi-Yu
Andrew Cohen
Ilia Kulikov
Janice Lan
Shibo Hao
Yuandong Tian
Jason Weston
Xian Li
CLL
146
4
0
12 Feb 2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
DiJia Su
Hanlin Zhu
Yingchen Xu
Jiantao Jiao
Yuandong Tian
Qinqing Zheng
LRM
142
22
0
05 Feb 2025
Latent Thought Models with Variational Bayes Inference-Time Computation
Latent Thought Models with Variational Bayes Inference-Time Computation
Deqian Kong
Minglu Zhao
Dehong Xu
Bo Pang
Shu Wang
...
Zhangzhang Si
Chuan Li
Jianwen Xie
Sirui Xie
Ying Nian Wu
VLMLRMBDL
143
10
0
03 Feb 2025
Efficient Reasoning with Hidden Thinking
Efficient Reasoning with Hidden Thinking
Xuan Shen
Yizhou Wang
Xiangxi Shi
Yanzhi Wang
Pu Zhao
Jiuxiang Gu
LRM
106
16
0
31 Jan 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
392
2,024
0
22 Jan 2025
1