Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.20050
Cited By
Let's Verify Step by Step
International Conference on Learning Representations (ICLR), 2023
31 May 2023
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALM
OffRL
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (10 upvotes)
Papers citing
"Let's Verify Step by Step"
39 / 1,389 papers shown
Title
Chain-of-Thought Reasoning is a Policy Improvement Operator
Hugh Zhang
David C. Parkes
ReLM
LM&Ro
LRM
183
15
0
15 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
International Conference on Machine Learning (ICML), 2023
Eran Malach
LRM
200
53
0
13 Sep 2023
FLM-101B: An Open LLM and How to Train It with $100K Budget
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
397
27
0
07 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Computational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRM
RALM
HILM
646
790
0
03 Sep 2023
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models
International Conference on Learning Representations (ICLR), 2023
Hritik Bansal
John Dang
Aditya Grover
ALM
206
25
0
30 Aug 2023
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models
Ran Bi
Su He
Zhenyu He
Jiacheng Lin
Qizhi Pei
Jie Shao
Wei Zhang
LM&MA
SyDa
162
14
0
27 Aug 2023
Large Language Models Streamline Automated Machine Learning for Clinical Studies
Nature Communications (Nat. Commun.), 2023
Soroosh Tayebi Arasteh
T. Han
Mahshad Lotfinia
Christiane Kuhl
Jakob Nikolas Kather
Daniel Truhn
S. Nebelung
ELM
LM&MA
AI4MH
267
87
0
27 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
555
31
0
23 Aug 2023
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
International Conference on Learning Representations (ICLR), 2023
Haipeng Luo
Qingfeng Sun
Can Xu
Lu Wang
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
LRM
OSLM
760
611
0
18 Aug 2023
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
International Conference on Learning Representations (ICLR), 2023
Aojun Zhou
Ke Wang
Zimu Lu
Weikang Shi
Sichun Luo
...
Shaoqing Lu
Anya Jia
Linqi Song
Mingjie Zhan
Jiaming Song
ReLM
LRM
153
193
0
15 Aug 2023
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Weisen Jiang
Han Shi
L. Yu
Zheng Liu
Yu Zhang
Zhenguo Li
James T. Kwok
LRM
379
43
0
15 Aug 2023
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
Ariel N. Lee
Cole J. Hunter
Nataniel Ruiz
ALM
ObjD
203
171
0
14 Aug 2023
Diagnostic Reasoning Prompts Reveal the Potential for Large Language Model Interpretability in Medicine
Thomas Savage
Ashwin Nayak
Roberta Gallo
E. Rangan
Jonathan Chen
LM&MA
ELM
LRM
AI4CE
173
211
0
13 Aug 2023
Detecting and Preventing Hallucinations in Large Vision Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Anisha Gunjal
Jihan Yin
Erhan Bas
MLLM
VLM
253
244
0
11 Aug 2023
Cumulative Reasoning with Large Language Models
Yifan Zhang
Jingqin Yang
Yang Yuan
Andrew Chi-Chih Yao
ReLM
ELM
LRM
AI4CE
502
87
0
08 Aug 2023
Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Binfeng Xu
Xukun Liu
Hua Shen
Zeyu Han
Yuhan Li
Murong Yue
Zhi-Ping Peng
Yuchen Liu
Ziyu Yao
Dongkuan Xu
LLMAG
199
24
0
08 Aug 2023
NEOLAF, an LLM-powered neural-symbolic cognitive architecture
Richard Tong
Cassie Chen Cao
Timothy Xueqian Lee
Guodong Zhao
Ray Wan
...
Xiangen Hu
Robin Schmucker
Jinsheng Pan
Julian Quevedo
Yu Lu
105
1
0
08 Aug 2023
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies
Liangming Pan
Michael Stephen Saxon
Wenda Xu
Deepak Nathani
Xinyi Wang
William Yang Wang
KELM
LRM
359
262
0
06 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRM
ALM
251
279
0
03 Aug 2023
VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception
Jiyoung Lee
Seung Wook Kim
Seunghyun Won
Joonseok Lee
Marzyeh Ghassemi
James Thorne
Jaeseok Choi
O.-Kil Kwon
Edward Choi
280
2
0
03 Aug 2023
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning
International Conference on Learning Representations (ICLR), 2023
Ning Miao
Yee Whye Teh
Tom Rainforth
ReLM
LRM
345
173
0
01 Aug 2023
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
International Journal of Computer Vision (IJCV), 2023
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
247
11
0
31 Jul 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
341
697
0
27 Jul 2023
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
Ethan Chern
Steffi Chern
Shiqi Chen
Weizhe Yuan
Kehua Feng
Chunting Zhou
Junxian He
Graham Neubig
Pengfei Liu
HILM
259
260
0
25 Jul 2023
Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla
Tom Lieberum
Matthew Rahtz
János Kramár
Neel Nanda
G. Irving
Rohin Shah
Vladimir Mikulik
289
138
0
18 Jul 2023
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Ansh Radhakrishnan
Karina Nguyen
Anna Chen
Carol Chen
Carson E. Denison
...
Zac Hatfield-Dodds
Jared Kaplan
J. Brauner
Sam Bowman
Ethan Perez
ReLM
LRM
HILM
177
103
0
17 Jul 2023
MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning
International Conference on Language Resources and Evaluation (LREC), 2023
Zhenwen Liang
Dian Yu
Xiaoman Pan
Wenlin Yao
Qingkai Zeng
Xiangliang Zhang
Dong Yu
ALM
LRM
169
18
0
16 Jul 2023
Large Language Models
Communications of the ACM (CACM), 2023
Michael R Douglas
LLMAG
LM&MA
563
915
0
11 Jul 2023
Teaching Arithmetic to Small Transformers
Nayoung Lee
Kartik K. Sreenivasan
Jason D. Lee
Kangwook Lee
Dimitris Papailiopoulos
LRM
247
113
0
07 Jul 2023
Let Me Teach You: Pedagogical Foundations of Feedback for Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Beatriz Borges
Niket Tandon
Tanja Käser
Antoine Bosselut
408
8
0
01 Jul 2023
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles
Paul Tarau
LRM
108
1
0
24 Jun 2023
Temporal Data Meets LLM -- Explainable Financial Time Series Forecasting
Xinli Yu
Zheng Chen
Yuan Ling
Shujing Dong
Zongying Liu
Yanbin Lu
AIFin
AI4TS
311
106
0
19 Jun 2023
Domain-specific ChatBots for Science using Embeddings
Digital Discovery (DD), 2023
Kevin G. Yager
156
15
0
15 Jun 2023
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Xuekai Zhu
Biqing Qi
Kaiyan Zhang
Xingwei Long
Zhouhan Lin
Bowen Zhou
ALM
LRM
253
26
0
23 May 2023
Towards Legally Enforceable Hate Speech Detection for Public Forums
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chunyan Luo
R. Bhambhoria
Xiao-Dan Zhu
Samuel Dahan
AILaw
206
9
0
23 May 2023
Evaluation of medium-large Language Models at zero-shot closed book generative question answering
Artificial Intelligence and Applications (AIA), 2023
René Peinl
Johannes Wirth
ELM
160
7
0
19 May 2023
Manifestations of Xenophobia in AI Systems
Ai & Society (AS), 2022
Nenad Tomašev
J. L. Maynard
Iason Gabriel
337
11
0
15 Dec 2022
Large Language Models Can Self-Improve
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jiaxin Huang
S. Gu
Le Hou
Yuexin Wu
Xuezhi Wang
Hongkun Yu
Jiawei Han
ReLM
AI4MH
LRM
551
743
0
20 Oct 2022
Pretrained Language Models for Text Generation: A Survey
ACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
478
245
0
14 Jan 2022
Previous
1
2
3
...
26
27
28