ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.17311
  4. Cited By
Universal Self-Consistency for Large Language Model Generation

Universal Self-Consistency for Large Language Model Generation

29 November 2023
Xinyun Chen
Renat Aksitov
Uri Alon
Jie Jessie Ren
Kefan Xiao
Pengcheng Yin
Sushant Prakash
Charles Sutton
Xuezhi Wang
Denny Zhou
    LRM
ArXivPDFHTML

Papers citing "Universal Self-Consistency for Large Language Model Generation"

50 / 59 papers shown
Title
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
Toghrul Abbasli
Kentaroh Toyoda
Yuan Wang
Leon Witt
Muhammad Asif Ali
Yukai Miao
Dan Li
Qingsong Wei
UQCV
79
0
0
25 Apr 2025
AI Awareness
AI Awareness
X. Li
Haoyuan Shi
Rongwu Xu
Wei Xu
54
0
0
25 Apr 2025
TTRL: Test-Time Reinforcement Learning
TTRL: Test-Time Reinforcement Learning
Yuxin Zuo
Kaiyan Zhang
Shang Qu
Li Sheng
Xuekai Zhu
Biqing Qi
Youbang Sun
Ganqu Cui
Ning Ding
Bowen Zhou
OffRL
35
1
0
22 Apr 2025
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection
MingShan Liu
Shi Bo
Jialing Fang
LRM
22
0
0
13 Apr 2025
Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute
Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute
Jianhao Chen
Zishuo Xun
Bocheng Zhou
Han Qi
Qiaosheng Zhang
...
Wei Hu
Yuzhong Qu
W. Ouyang
Wanli Ouyang
Shuyue Hu
74
0
0
01 Apr 2025
Video-T1: Test-Time Scaling for Video Generation
Video-T1: Test-Time Scaling for Video Generation
F. Liu
Hanyang Wang
Yimo Cai
Kaiyan Zhang
Xiaohang Zhan
Yueqi Duan
DiffM
VGen
76
1
0
24 Mar 2025
When Debate Fails: Bias Reinforcement in Large Language Models
When Debate Fails: Bias Reinforcement in Large Language Models
Jihwan Oh
Minchan Jeong
Jongwoo Ko
Se-Young Yun
LLMAG
AI4CE
49
0
0
21 Mar 2025
$ϕ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
ϕϕϕ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
Fangzhi Xu
Hang Yan
Chang Ma
Haiteng Zhao
Jun Liu
Qika Lin
Zhiyong Wu
36
2
0
17 Mar 2025
"Well, Keep Thinking": Enhancing LLM Reasoning with Adaptive Injection Decoding
"Well, Keep Thinking": Enhancing LLM Reasoning with Adaptive Injection Decoding
Hyunbin Jin
Je Won Yeom
Seunghyun Bae
Taesup Kim
LRM
ReLM
37
1
0
13 Mar 2025
Enhancing Reasoning with Collaboration and Memory
Julie Michelman
Nasrin Baratalipour
Matthew Abueg
LLMAG
FedML
59
1
0
07 Mar 2025
Speculative Decoding for Multi-Sample Inference
Yiwei Li
Jiayi Shi
Shaoxiong Feng
Peiwen Yuan
X. Wang
...
Ji Zhang
Chuyi Tan
Boyuan Pan
Yao Hu
Kan Li
LRM
38
0
0
07 Mar 2025
From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment
Yutian Pang
Andrew Paul Kendall
Alex Porcayo
Mariah Barsotti
Anahita Jain
John-Paul Clarke
41
0
0
06 Mar 2025
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
Zichong Li
Xinyu Feng
Yuheng Cai
Zixuan Zhang
Tianyi Liu
Chen Liang
Weizhu Chen
Haoyu Wang
T. Zhao
LRM
48
1
0
06 Mar 2025
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Yiwei Li
Ji Zhang
Shaoxiong Feng
Peiwen Yuan
X. Wang
...
Y. Zhang
Chuyi Tan
Boyuan Pan
Yao Hu
Kan Li
HILM
36
1
0
27 Feb 2025
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Zhewei Kang
Xuandong Zhao
Dawn Song
LRM
57
2
0
25 Feb 2025
Disproving Program Equivalence with LLMs
Disproving Program Equivalence with LLMs
Miltiadis Allamanis
Pengcheng Yin
40
0
0
05 Feb 2025
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Wenzhe Li
Yong Lin
Mengzhou Xia
Chi Jin
MoE
71
2
0
02 Feb 2025
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of
  Large Language Models
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models
Do Xuan Long
Duong Ngoc Yen
Anh Tuan Luu
Kenji Kawaguchi
Min-Yen Kan
Nancy F. Chen
KELM
ELM
LRM
21
3
0
01 Nov 2024
Autoformalize Mathematical Statements by Symbolic Equivalence and
  Semantic Consistency
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency
Zenan Li
Yifan Wu
Zhaoyu Li
Xinming Wei
Xian Zhang
Fan Yang
Xiaoxing Ma
32
2
0
28 Oct 2024
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses
Zonglin Yang
Wanhao Liu
Ben Gao
Tong Xie
Yuqiang Li
Wanli Ouyang
Soujanya Poria
Erik Cambria
Dongzhan Zhou
LRM
21
12
0
09 Oct 2024
MM-R$^3$: On (In-)Consistency of Multi-modal Large Language Models
  (MLLMs)
MM-R3^33: On (In-)Consistency of Multi-modal Large Language Models (MLLMs)
Shih-Han Chou
Shivam Chandhok
James J. Little
Leonid Sigal
32
0
0
07 Oct 2024
Better Instruction-Following Through Minimum Bayes Risk
Better Instruction-Following Through Minimum Bayes Risk
Ian Wu
Patrick Fernandes
Amanda Bertsch
Seungone Kim
Sina Pakazad
Graham Neubig
48
9
0
03 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
45
1
0
02 Oct 2024
From Code to Correctness: Closing the Last Mile of Code Generation with
  Hierarchical Debugging
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
Yuling Shi
Songsong Wang
Chengcheng Wan
Xiaodong Gu
ELM
19
6
0
02 Oct 2024
Multimodal Auto Validation For Self-Refinement in Web Agents
Multimodal Auto Validation For Self-Refinement in Web Agents
Ruhana Azam
Tamer Abuelsaad
Aditya Vempaty
Ashish Jagmohan
21
1
0
01 Oct 2024
A Survey on the Honesty of Large Language Models
A Survey on the Honesty of Large Language Models
Siheng Li
Cheng Yang
Taiqiang Wu
Chufan Shi
Yuji Zhang
...
Jie Zhou
Yujiu Yang
Ngai Wong
Xixin Wu
Wai Lam
HILM
22
4
0
27 Sep 2024
Watermarking Techniques for Large Language Models: A Survey
Watermarking Techniques for Large Language Models: A Survey
Yuqing Liang
Jiancheng Xiao
Wensheng Gan
Philip S. Yu
OffRL
19
3
0
26 Aug 2024
PEDAL: Enhancing Greedy Decoding with Large Language Models using
  Diverse Exemplars
PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars
Sumanth Prabhu
28
1
0
16 Aug 2024
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
Do Xuan Long
Hai Nguyen Ngoc
Tiviatis Sim
Hieu Dao
Shafiq R. Joty
Kenji Kawaguchi
Nancy F. Chen
Min-Yen Kan
21
7
0
16 Aug 2024
In-Context Example Selection via Similarity Search Improves Low-Resource
  Machine Translation
In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation
Joel Witzke
Benoît Sagot
Rachel Bawden
31
0
0
01 Aug 2024
SimCT: A Simple Consistency Test Protocol in LLMs Development Lifecycle
SimCT: A Simple Consistency Test Protocol in LLMs Development Lifecycle
Fufangchen Zhao
Guoqiang Jin
Rui Zhao
Jiangheng Huang
Fei Tan
27
1
0
24 Jul 2024
Internal Consistency and Self-Feedback in Large Language Models: A
  Survey
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Xun Liang
Shichao Song
Zifan Zheng
Hanyu Wang
Qingchen Yu
...
Rong-Hua Li
Peng Cheng
Zhonghao Wang
Feiyu Xiong
Zhiyu Li
HILM
LRM
56
23
0
19 Jul 2024
Localizing and Mitigating Errors in Long-form Question Answering
Localizing and Mitigating Errors in Long-form Question Answering
Rachneet Sachdeva
Yixiao Song
Mohit Iyyer
Iryna Gurevych
HILM
28
0
0
16 Jul 2024
Distilling System 2 into System 1
Distilling System 2 into System 1
Ping Yu
Jing Xu
Jason Weston
Ilia Kulikov
OffRL
LRM
38
55
0
08 Jul 2024
Integrate the Essence and Eliminate the Dross: Fine-Grained
  Self-Consistency for Free-Form Language Generation
Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation
Xinglin Wang
Yiwei Li
Shaoxiong Feng
Peiwen Yuan
Boyuan Pan
Heda Wang
Yao Hu
Kan Li
23
10
0
02 Jul 2024
Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in
  Self-Improving Generation
Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation
Jihyun Janice Ahn
Ryo Kamoi
Lu Cheng
Rui Zhang
Wenpeng Yin
25
1
0
27 Jun 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large
  Language Models
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
33
45
0
24 Jun 2024
Current state of LLM Risks and AI Guardrails
Current state of LLM Risks and AI Guardrails
Suriya Ganesh Ayyamperumal
Limin Ge
45
20
0
16 Jun 2024
ContraSolver: Self-Alignment of Language Models by Resolving Internal
  Preference Contradictions
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Xu Zhang
Xunjian Yin
Xiaojun Wan
40
3
0
13 Jun 2024
CaLM: Contrasting Large and Small Language Models to Verify Grounded
  Generation
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
I-Hung Hsu
Zifeng Wang
Long T. Le
Lesly Miculicich
Nanyun Peng
Chen-Yu Lee
Tomas Pfister
HILM
21
4
0
08 Jun 2024
To Believe or Not to Believe Your LLM
To Believe or Not to Believe Your LLM
Yasin Abbasi-Yadkori
Ilja Kuzborskij
András György
Csaba Szepesvári
UQCV
53
39
0
04 Jun 2024
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of
  Self-Correction of LLMs
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
Yusen Zhang
Nan Zhang
Jiawei Han
Rui Zhang
LRM
40
19
0
03 Jun 2024
Atomic Self-Consistency for Better Long Form Generations
Atomic Self-Consistency for Better Long Form Generations
Raghuveer Thirukovalluru
Yukun Huang
Bhuwan Dhingra
30
4
0
21 May 2024
Evaluating Consistency and Reasoning Capabilities of Large Language
  Models
Evaluating Consistency and Reasoning Capabilities of Large Language Models
Yash Saxena
Sarthak Chopra
Arunendra Mani Tripathi
ELM
LRM
28
5
0
25 Apr 2024
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in
  Large Language Models
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models
Yanhong Li
Chenghao Yang
Allyson Ettinger
ReLM
LRM
LLMAG
18
6
0
14 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent
  Deliberation
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Dongyeop Kang
LLMAG
27
3
0
14 Apr 2024
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive
  Data Analysis Agents
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
Jinyang Li
Nan Huo
Yan Gao
Jiayi Shi
Yingxiu Zhao
Ge Qu
Yurong Wu
Chenhao Ma
Jian-Guang Lou
Reynold Cheng
LLMAG
19
3
0
08 Mar 2024
SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Ziniu Hu
Ahmet Iscen
Aashi Jain
Thomas Kipf
Yisong Yue
David A. Ross
Cordelia Schmid
Alireza Fathi
LLMAG
21
23
0
02 Mar 2024
Approaching Human-Level Forecasting with Language Models
Approaching Human-Level Forecasting with Language Models
Danny Halawi
Fred Zhang
Chen Yueh-Han
Jacob Steinhardt
34
14
0
28 Feb 2024
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Ante Wang
Linfeng Song
Baolin Peng
Ye Tian
Lifeng Jin
Haitao Mi
Jinsong Su
Dong Yu
HILM
LRM
18
6
0
23 Feb 2024
12
Next