v1v2 (latest)

Crystal: Introspective Reasoners Reinforced with Self-Feedback

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

7 October 2023

Yejin Choi

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (6★)

Papers citing "Crystal: Introspective Reasoners Reinforced with Self-Feedback"

15 / 15 papers shown

ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering

Francesco Maria Molfese

196

10 Oct 2025

Scalable Complexity Control Facilitates Reasoning Ability of LLMs

...

238

29 May 2025

ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Francesco Maria Molfese

Simone Conia

Riccardo Orlando

Roberto Navigli

ReLM LRM RALM

230

07 Oct 2024

Rationale-Aware Answer Verification by Pairwise Self-EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Akira Kawabata

Saku Sugawara

LRM

393

07 Oct 2024

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Nathan Lambert

Yejin Choi

Hannaneh Hajishirzi

351

100

13 Jun 2024

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

441

23 May 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAGConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Peng Wang

Fei Huang

Huajun Chen

Ningyu Zhang

RALM

204

23 May 2024

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

496

219

01 May 2024

SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated ResponsesAAAI Conference on Artificial Intelligence (AAAI), 2024

Dongwei Jiang

Jingyu Zhang

Orion Weller

Nathaniel Weir

Benjamin Van Durme

Daniel Khashabi

242

04 Apr 2024

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

798

254

14 Mar 2024

Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning

Kang Liu

Jun Zhao

LRM

329

28 Feb 2024

Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?

Xianpei Han

Yaojie Lu

320

22 Feb 2024

Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

Boi Faltings

488

21 Feb 2024

KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

Maarten de Rijke

282

17 Feb 2024

Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and FutureAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

559

240

27 Sep 2023