ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.20050
  4. Cited By
Let's Verify Step by Step

Let's Verify Step by Step

International Conference on Learning Representations (ICLR), 2023
31 May 2023
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
    ALMOffRLLRM
ArXiv (abs)PDFHTMLHuggingFace (10 upvotes)

Papers citing "Let's Verify Step by Step"

39 / 1,389 papers shown
Title
Chain-of-Thought Reasoning is a Policy Improvement Operator
Chain-of-Thought Reasoning is a Policy Improvement Operator
Hugh Zhang
David C. Parkes
ReLMLM&RoLRM
183
15
0
15 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
Auto-Regressive Next-Token Predictors are Universal LearnersInternational Conference on Machine Learning (ICML), 2023
Eran Malach
LRM
200
53
0
13 Sep 2023
FLM-101B: An Open LLM and How to Train It with $100K Budget
FLM-101B: An Open LLM and How to Train It with $100K Budget
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
397
27
0
07 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsComputational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRMRALMHILM
646
790
0
03 Sep 2023
Peering Through Preferences: Unraveling Feedback Acquisition for
  Aligning Large Language Models
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Hritik Bansal
John Dang
Aditya Grover
ALM
206
25
0
30 Aug 2023
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on
  Language, Multimodal, and Scientific GPT Models
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models
Ran Bi
Su He
Zhenyu He
Jiacheng Lin
Qizhi Pei
Jie Shao
Wei Zhang
LM&MASyDa
162
14
0
27 Aug 2023
Large Language Models Streamline Automated Machine Learning for Clinical
  Studies
Large Language Models Streamline Automated Machine Learning for Clinical StudiesNature Communications (Nat. Commun.), 2023
Soroosh Tayebi Arasteh
T. Han
Mahshad Lotfinia
Christiane Kuhl
Jakob Nikolas Kather
Daniel Truhn
S. Nebelung
ELMLM&MAAI4MH
267
87
0
27 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
555
31
0
23 Aug 2023
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructInternational Conference on Learning Representations (ICLR), 2023
Haipeng Luo
Qingfeng Sun
Can Xu
Lu Wang
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
LRMOSLM
760
611
0
18 Aug 2023
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with
  Code-based Self-Verification
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-VerificationInternational Conference on Learning Representations (ICLR), 2023
Aojun Zhou
Ke Wang
Zimu Lu
Weikang Shi
Sichun Luo
...
Shaoqing Lu
Anya Jia
Linqi Song
Mingjie Zhan
Jiaming Song
ReLMLRM
153
193
0
15 Aug 2023
Forward-Backward Reasoning in Large Language Models for Mathematical
  Verification
Forward-Backward Reasoning in Large Language Models for Mathematical VerificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Weisen Jiang
Han Shi
L. Yu
Zheng Liu
Yu Zhang
Zhenguo Li
James T. Kwok
LRM
379
43
0
15 Aug 2023
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
Ariel N. Lee
Cole J. Hunter
Nataniel Ruiz
ALMObjD
203
171
0
14 Aug 2023
Diagnostic Reasoning Prompts Reveal the Potential for Large Language
  Model Interpretability in Medicine
Diagnostic Reasoning Prompts Reveal the Potential for Large Language Model Interpretability in Medicine
Thomas Savage
Ashwin Nayak
Roberta Gallo
E. Rangan
Jonathan Chen
LM&MAELMLRMAI4CE
173
211
0
13 Aug 2023
Detecting and Preventing Hallucinations in Large Vision Language Models
Detecting and Preventing Hallucinations in Large Vision Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Anisha Gunjal
Jihan Yin
Erhan Bas
MLLMVLM
253
244
0
11 Aug 2023
Cumulative Reasoning with Large Language Models
Cumulative Reasoning with Large Language Models
Yifan Zhang
Jingqin Yang
Yang Yuan
Andrew Chi-Chih Yao
ReLMELMLRMAI4CE
502
87
0
08 Aug 2023
Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Gentopia: A Collaborative Platform for Tool-Augmented LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Binfeng Xu
Xukun Liu
Hua Shen
Zeyu Han
Yuhan Li
Murong Yue
Zhi-Ping Peng
Yuchen Liu
Ziyu Yao
Dongkuan Xu
LLMAG
199
24
0
08 Aug 2023
NEOLAF, an LLM-powered neural-symbolic cognitive architecture
NEOLAF, an LLM-powered neural-symbolic cognitive architecture
Richard Tong
Cassie Chen Cao
Timothy Xueqian Lee
Guodong Zhao
Ray Wan
...
Xiangen Hu
Robin Schmucker
Jinsheng Pan
Julian Quevedo
Yu Lu
105
1
0
08 Aug 2023
Automatically Correcting Large Language Models: Surveying the landscape
  of diverse self-correction strategies
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies
Liangming Pan
Michael Stephen Saxon
Wenda Xu
Deepak Nathani
Xinyi Wang
William Yang Wang
KELMLRM
359
262
0
06 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large
  Language Models
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRMALM
251
279
0
03 Aug 2023
VisAlign: Dataset for Measuring the Degree of Alignment between AI and
  Humans in Visual Perception
VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception
Jiyoung Lee
Seung Wook Kim
Seunghyun Won
Joonseok Lee
Marzyeh Ghassemi
James Thorne
Jaeseok Choi
O.-Kil Kwon
Edward Choi
280
2
0
03 Aug 2023
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step
  Reasoning
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step ReasoningInternational Conference on Learning Representations (ICLR), 2023
Ning Miao
Yee Whye Teh
Tom Rainforth
ReLMLRM
345
173
0
01 Aug 2023
On the Trustworthiness Landscape of State-of-the-art Generative Models:
  A Survey and Outlook
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and OutlookInternational Journal of Computer Vision (IJCV), 2023
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
247
11
0
31 Jul 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from
  Human Feedback
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALMOffRL
341
697
0
27 Jul 2023
FacTool: Factuality Detection in Generative AI -- A Tool Augmented
  Framework for Multi-Task and Multi-Domain Scenarios
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
Ethan Chern
Steffi Chern
Shiqi Chen
Weizhe Yuan
Kehua Feng
Chunting Zhou
Junxian He
Graham Neubig
Pengfei Liu
HILM
259
260
0
25 Jul 2023
Does Circuit Analysis Interpretability Scale? Evidence from Multiple
  Choice Capabilities in Chinchilla
Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla
Tom Lieberum
Matthew Rahtz
János Kramár
Neel Nanda
G. Irving
Rohin Shah
Vladimir Mikulik
289
138
0
18 Jul 2023
Question Decomposition Improves the Faithfulness of Model-Generated
  Reasoning
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Ansh Radhakrishnan
Karina Nguyen
Anna Chen
Carol Chen
Carson E. Denison
...
Zac Hatfield-Dodds
Jared Kaplan
J. Brauner
Sam Bowman
Ethan Perez
ReLMLRMHILM
177
103
0
17 Jul 2023
MinT: Boosting Generalization in Mathematical Reasoning via Multi-View
  Fine-Tuning
MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-TuningInternational Conference on Language Resources and Evaluation (LREC), 2023
Zhenwen Liang
Dian Yu
Xiaoman Pan
Wenlin Yao
Qingkai Zeng
Xiangliang Zhang
Dong Yu
ALMLRM
169
18
0
16 Jul 2023
Large Language Models
Large Language ModelsCommunications of the ACM (CACM), 2023
Michael R Douglas
LLMAGLM&MA
563
915
0
11 Jul 2023
Teaching Arithmetic to Small Transformers
Teaching Arithmetic to Small Transformers
Nayoung Lee
Kartik K. Sreenivasan
Jason D. Lee
Kangwook Lee
Dimitris Papailiopoulos
LRM
247
113
0
07 Jul 2023
Let Me Teach You: Pedagogical Foundations of Feedback for Language
  Models
Let Me Teach You: Pedagogical Foundations of Feedback for Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Beatriz Borges
Niket Tandon
Tanja Käser
Antoine Bosselut
408
8
0
01 Jul 2023
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors
  and Refiner Oracles
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles
Paul Tarau
LRM
108
1
0
24 Jun 2023
Temporal Data Meets LLM -- Explainable Financial Time Series Forecasting
Temporal Data Meets LLM -- Explainable Financial Time Series Forecasting
Xinli Yu
Zheng Chen
Yuan Ling
Shujing Dong
Zongying Liu
Yanbin Lu
AIFinAI4TS
311
106
0
19 Jun 2023
Domain-specific ChatBots for Science using Embeddings
Domain-specific ChatBots for Science using EmbeddingsDigital Discovery (DD), 2023
Kevin G. Yager
156
15
0
15 Jun 2023
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better
  than Chain-of-thought Fine-tuning
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Xuekai Zhu
Biqing Qi
Kaiyan Zhang
Xingwei Long
Zhouhan Lin
Bowen Zhou
ALMLRM
253
26
0
23 May 2023
Towards Legally Enforceable Hate Speech Detection for Public Forums
Towards Legally Enforceable Hate Speech Detection for Public ForumsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chunyan Luo
R. Bhambhoria
Xiao-Dan Zhu
Samuel Dahan
AILaw
206
9
0
23 May 2023
Evaluation of medium-large Language Models at zero-shot closed book
  generative question answering
Evaluation of medium-large Language Models at zero-shot closed book generative question answeringArtificial Intelligence and Applications (AIA), 2023
René Peinl
Johannes Wirth
ELM
160
7
0
19 May 2023
Manifestations of Xenophobia in AI Systems
Manifestations of Xenophobia in AI SystemsAi & Society (AS), 2022
Nenad Tomašev
J. L. Maynard
Iason Gabriel
337
11
0
15 Dec 2022
Large Language Models Can Self-Improve
Large Language Models Can Self-ImproveConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jiaxin Huang
S. Gu
Le Hou
Yuexin Wu
Xuezhi Wang
Hongkun Yu
Jiawei Han
ReLMAI4MHLRM
551
743
0
20 Oct 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
478
245
0
14 Jan 2022
Previous
123...262728