ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.21074
  4. Cited By
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
v1v2v3 (latest)

CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation

28 February 2025
Zhenyi Shen
Hanqi Yan
Linhai Zhang
Zhanghao Hu
Yali Du
Yulan He
    LRM
ArXiv (abs)PDFHTMLGithub (25★)

Papers citing "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"

39 / 89 papers shown
Title
HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Chengyu Huang
Zhengxin Zhang
Claire Cardie
LRM
320
6
0
16 May 2025
Dynamic Early Exit in Reasoning Models
Dynamic Early Exit in Reasoning Models
Chenxu Yang
Qingyi Si
Yongjie Duan
Zheliang Zhu
Chenyu Zhu
Zheng Lin
Zheng Lin
Li Cao
Weiping Wang
ReLMLRM
499
93
0
22 Apr 2025
Efficient Reasoning Models: A Survey
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLMLRM
862
40
0
15 Apr 2025
Reasoning Models Can Be Effective Without Thinking
Reasoning Models Can Be Effective Without Thinking
Wenjie Ma
Jingxuan He
Charlie Snell
Tyler Griggs
Sewon Min
Matei A. Zaharia
ReLMLRM
364
106
1
14 Apr 2025
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
Bairu Hou
Yang Zhang
Jiabao Ji
Yujian Liu
Kaizhi Qian
Jacob Andreas
Shiyu Chang
OffRLLRM
286
73
0
02 Apr 2025
Efficient Inference for Large Reasoning Models: A Survey
Efficient Inference for Large Reasoning Models: A Survey
Yi Liu
Jiaying Wu
Yufei He
Hongcheng Gao
Hongyu Chen
...
Xu Cheng
Zhiqi Huang
Bryan Hooi
Stan Z. Li
Keqin Li
LLMAGLRM
500
49
0
29 Mar 2025
Learning to Instruct for Visual Instruction Tuning
Learning to Instruct for Visual Instruction Tuning
Zhihan Zhou
Feng Hong
Jiaan Luo
Jiangchao Yao
Dongsheng Li
Bo Han
Yujiao Shi
Yanfeng Wang
VLM
379
3
0
28 Mar 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Andrew Wen
Shaochen
Zhong
Hanjie Chen
Helen Zhou
OffRLReLMLRM
680
255
0
20 Mar 2025
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
Yi Shen
Jing Zhang
Jieyun Huang
Shuming Shi
Wenjing Zhang
Jiangze Yan
Rongjia Du
Ning Wang
Kai Wang
Shiguo Lian
LRM
469
112
0
06 Mar 2025
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Reasoning with Latent Thoughts: On the Power of Looped TransformersInternational Conference on Learning Representations (ICLR), 2025
Nikunj Saunshi
Nishanth Dikkala
Zhiyuan Li
Sanjiv Kumar
Sashank J. Reddi
OffRLLRMAI4CE
391
62
0
24 Feb 2025
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yige Xu
Xu Guo
Zhiwei Zeng
Chunyan Miao
LLMAGCLLLRM
374
55
0
17 Feb 2025
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Qifan Yu
Zhenyu He
Sijie Li
Xun Zhou
Jun Zhang
Jingjing Xu
Di He
OffRLLRM
298
13
0
12 Feb 2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
DiJia Su
Hanlin Zhu
Yingchen Xu
Jiantao Jiao
Yuandong Tian
Qinqing Zheng
LRM
325
25
0
05 Feb 2025
Deliberation in Latent Space via Differentiable Cache Augmentation
Deliberation in Latent Space via Differentiable Cache Augmentation
Luyang Liu
Jonas Pfeiffer
Jiaxing Wu
Jun Xie
Arthur Szlam
RALM
116
13
0
23 Dec 2024
Compressed Chain of Thought: Efficient Reasoning Through Dense
  Representations
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Jeffrey Cheng
Benjamin Van Durme
LRM
288
99
0
17 Dec 2024
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
LLMs Know More Than They Show: On the Intrinsic Representation of LLM HallucinationsInternational Conference on Learning Representations (ICLR), 2024
Hadas Orgad
Michael Toker
Zorik Gekhman
Roi Reichart
Idan Szpektor
Hadas Kotek
Yonatan Belinkov
HILMAIFin
655
109
0
03 Oct 2024
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by
  Step
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
Yuntian Deng
Yejin Choi
Stuart M. Shieber
ReLMLRM
231
119
0
23 May 2024
Let's Think Dot by Dot: Hidden Computation in Transformer Language
  Models
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
Jacob Pfau
William Merrill
Samuel R. Bowman
LRM
256
127
0
24 Apr 2024
In-Context Learning State Vector with Inner and Momentum Optimization
In-Context Learning State Vector with Inner and Momentum Optimization
Dongfang Li
Zhenyu Liu
Xinshuo Hu
Zetian Sun
Baotian Hu
Min Zhang
244
12
0
17 Apr 2024
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Zhaorui Yang
Tianyu Pang
Hao Feng
Han Wang
Wei Chen
Minfeng Zhu
Qian Liu
ALM
278
76
0
21 Feb 2024
Chain of Thought Empowers Transformers to Solve Inherently Serial
  Problems
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Zhiyuan Li
Hong Liu
Denny Zhou
Tengyu Ma
LRMAI4CE
359
204
0
20 Feb 2024
In-context Vectors: Making In Context Learning More Effective and
  Controllable Through Latent Space Steering
In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space SteeringInternational Conference on Machine Learning (ICML), 2023
Sheng Liu
Haotian Ye
Lei Xing
James Y. Zou
234
198
0
11 Nov 2023
What Formal Languages Can Transformers Express? A Survey
What Formal Languages Can Transformers Express? A SurveyTransactions of the Association for Computational Linguistics (TACL), 2023
Lena Strobl
William Merrill
Gail Weiss
David Chiang
Dana Angluin
AI4CE
414
95
0
01 Nov 2023
The Expressive Power of Transformers with Chain of Thought
The Expressive Power of Transformers with Chain of Thought
William Merrill
Ashish Sabharwal
LRMAI4CEReLM
463
41
0
11 Oct 2023
Think before you speak: Training Language Models With Pause Tokens
Think before you speak: Training Language Models With Pause TokensInternational Conference on Learning Representations (ICLR), 2023
Sachin Goyal
Ziwei Ji
A. S. Rawat
A. Menon
Sanjiv Kumar
Vaishnavh Nagarajan
LRM
374
183
0
03 Oct 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less
  Training Data and Smaller Model Sizes
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model SizesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
720
715
0
03 May 2023
Self-Instruct: Aligning Language Models with Self-Generated Instructions
Self-Instruct: Aligning Language Models with Self-Generated InstructionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yizhong Wang
Yeganeh Kordi
Swaroop Mishra
Alisa Liu
Noah A. Smith
Daniel Khashabi
Hannaneh Hajishirzi
ALMSyDaLRM
745
2,781
0
20 Dec 2022
Large Language Models Are Reasoning Teachers
Large Language Models Are Reasoning TeachersAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLMELMLRM
316
430
0
20 Dec 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
2.0K
17,148
0
04 Mar 2022
Training Verifiers to Solve Math Word Problems
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLMOffRLLRM
1.0K
6,657
0
27 Oct 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
The Power of Scale for Parameter-Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
1.3K
4,893
0
18 Apr 2021
Are NLP Models really able to Solve Simple Math Word Problems?
Are NLP Models really able to Solve Simple Math Word Problems?North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Arkil Patel
S. Bhattamishra
Navin Goyal
ReLMLRM
338
1,051
0
12 Mar 2021
Analyzing Curriculum Learning for Sentiment Analysis along Task
  Difficulty, Pacing and Visualization Axes
Analyzing Curriculum Learning for Sentiment Analysis along Task Difficulty, Pacing and Visualization AxesWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2021
Anvesh Rao Vijjini
Kaveri Anuranjana
R. Mamidi
229
4
0
19 Feb 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
650
5,139
0
01 Jan 2021
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
426
712
0
10 Sep 2020
Knowledge Distillation: A Survey
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
1.8K
3,645
0
09 Jun 2020
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
332
2,118
0
02 Nov 2018
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
R. Speer
Joshua Chin
Catherine Havasi
741
3,129
0
12 Dec 2016
Solving General Arithmetic Word Problems
Solving General Arithmetic Word Problems
Subhro Roy
Dan Roth
AIMat
265
560
0
04 Aug 2016
Previous
12