Nash CoT: Multi-Path Inference with Preference Equilibrium

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

31 December 2024

ArXiv (abs)PDF HTML Github (2019★)

Papers citing "Nash CoT: Multi-Path Inference with Preference Equilibrium"

20 / 20 papers shown

Accelerating Particle-based Energetic Variational Inference

360

04 Apr 2025

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Team GLM

Aohan Zeng

Bin Xu

Bowen Wang

...

Zhaoyu Wang

Zhen Yang

Zhengxiao Du

Zhenyu Hou

Zihan Wang

ALM

489

1,372

18 Jun 2024

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language ModelsInternational Conference on Machine Learning (ICML), 2024

Quanquan Gu

741

510

02 Jan 2024

Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023

Christopher D. Manning

Chelsea Finn

ALM

1.1K

7,889

29 May 2023

Tab-CoT: Zero-shot Tabular Chain of ThoughtAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Ziqi Jin

Wei Lu

ReLM LMTD LRM

156

28 May 2023

Large Language Models Can Self-ImproveConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

813

821

20 Oct 2022

Language Models are Multilingual Chain-of-Thought ReasonersInternational Conference on Learning Representations (ICLR), 2022

...

703

550

06 Oct 2022

Large Language Models are Zero-Shot ReasonersNeural Information Processing Systems (NeurIPS), 2022

1.6K

6,749

24 May 2022

Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022

3.7K

6,303

21 Mar 2022

Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022

Carroll L. Wainwright

...

2.4K

19,487

04 Mar 2022

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

...

613

1,572

08 Dec 2021

Training Verifiers to Solve Math Word Problems

...

1.5K

8,043

27 Oct 2021

GLM: General Language Model Pretraining with Autoregressive Blank InfillingAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Zhengxiao Du

Yujie Qian

Xiao Liu

Ming Ding

550

1,891

18 Mar 2021

Are NLP Models really able to Solve Simple Math Word Problems?North American Chapter of the Association for Computational Linguistics (NAACL), 2021

646

1,164

12 Mar 2021

Measuring Mathematical Problem Solving With the MATH Dataset

1.2K

4,730

05 Mar 2021

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning StrategiesTransactions of the Association for Computational Linguistics (TACL), 2021

Daniel Khashabi

1.2K

995

06 Jan 2021

Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020

...

2.4K

56,453

28 May 2020

CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge

504

2,321

02 Nov 2018

Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems

619

922

11 May 2017

Solving General Arithmetic Word Problems

Subhro Roy

Dan Roth

AIMat

485

608

04 Aug 2016