ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14325
  4. Cited By
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate

Improving Factuality and Reasoning in Language Models through Multiagent Debate

23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
    LLMAG
    LRM
ArXivPDFHTML

Papers citing "Improving Factuality and Reasoning in Language Models through Multiagent Debate"

50 / 452 papers shown
Title
Internet of Agents: Fundamentals, Applications, and Challenges
Internet of Agents: Fundamentals, Applications, and Challenges
Yuntao Wang
Shaolong Guo
Yanghe Pan
Zhou Su
Fahao Chen
Tom H. Luan
Peng Li
Jiawen Kang
Dusit Niyato
LLMAG
LM&Ro
AI4CE
36
0
0
12 May 2025
Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study
Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study
Baixuan Xu
Chunyang Li
Weiqi Wang
Wei Fan
Tianshi Zheng
H. Shi
Tao Fan
Yangqiu Song
Qiang Yang
14
0
0
12 May 2025
Adaptive Stress Testing Black-Box LLM Planners
Adaptive Stress Testing Black-Box LLM Planners
Neeloy Chakraborty
John Pohovey
Melkior Ornik
Katherine Driggs-Campbell
18
0
0
08 May 2025
Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design
Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design
Elena Musi
Nadin Kokciyan
Khalid Al Khatib
Davide Ceolin
Emmanuelle Dietz
...
Jodi Schneider
Jonas Scholz
Cor Steging
Jacky Visser
Henning Wachsmuth
LRM
39
0
0
08 May 2025
G-FOCUS: Towards a Robust Method for Assessing UI Design Persuasiveness
G-FOCUS: Towards a Robust Method for Assessing UI Design Persuasiveness
Jaehyun Jeon
Janghan Yoon
Minsoo Kim
Sumin Shim
Yejin Choi
Hanbin Kim
Youngjae Yu
AAML
31
0
0
08 May 2025
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Xiaobao Wu
LRM
60
0
0
05 May 2025
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Liaoyaqi Wang
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
52
0
0
02 May 2025
Multi-agents based User Values Mining for Recommendation
Multi-agents based User Values Mining for Recommendation
L. Chen
Wei Yuan
Tong Chen
Xiangyu Zhao
Nguyen Quoc Viet Hung
Hongzhi Yin
OffRL
37
0
0
02 May 2025
MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness
MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness
Junsheng Huang
Zhitao He
Sandeep Polisetty
Q. Wang
May Fung
KELM
43
0
0
30 Apr 2025
Evolution of AI in Education: Agentic Workflows
Evolution of AI in Education: Agentic Workflows
Firuz Kamalov
David Santandreu Calonge
Linda Smail
Dilshod Azizov
Dimple R. Thadani
Theresa Kwong
Amara Atif
40
0
0
25 Apr 2025
Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society
Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society
Feifei Zhao
Y. Wang
Enmeng Lu
Dongcheng Zhao
Bing Han
...
Chao Liu
Yaodong Yang
Yi Zeng
Boyuan Chen
Jinyu Fan
80
0
0
24 Apr 2025
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
Z. Wang
K. Wang
Q. Wang
Pingyue Zhang
Linjie Li
...
Jiajun Wu
L. Fei-Fei
Lijuan Wang
Yejin Choi
Manling Li
73
1
0
24 Apr 2025
FlowReasoner: Reinforcing Query-Level Meta-Agents
FlowReasoner: Reinforcing Query-Level Meta-Agents
Hongcheng Gao
Yue Liu
Yufei He
Longxu Dou
C. Du
Zhijie Deng
Bryan Hooi
Min Lin
Tianyu Pang
AIFin
LRM
24
1
0
21 Apr 2025
A Self-Improving Coding Agent
A Self-Improving Coding Agent
Maxime Robeyns
Martin Szummer
Laurence Aitchison
LLMAG
31
0
0
21 Apr 2025
Planet as a Brain: Towards Internet of AgentSites based on AIOS Server
Planet as a Brain: Towards Internet of AgentSites based on AIOS Server
Xiang Zhang
Yongfeng Zhang
33
0
0
19 Apr 2025
Teaching Large Language Models to Reason through Learning and Forgetting
Teaching Large Language Models to Reason through Learning and Forgetting
Tianwei Ni
Allen Nie
Sapana Chaudhary
Yao Liu
Huzefa Rangwala
Rasool Fakoor
ReLM
CLL
LRM
41
0
0
15 Apr 2025
EMAFusion: A Self-Optimizing System for Seamless LLM Selection and Integration
EMAFusion: A Self-Optimizing System for Seamless LLM Selection and Integration
Soham Shah
Kumar Shridhar
Surojit Chatterjee
Souvik Sen
32
0
0
14 Apr 2025
Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning
Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning
Jingtian Wu
Claire Cardie
LRM
19
0
0
14 Apr 2025
Can Competition Enhance the Proficiency of Agents Powered by Large Language Models in the Realm of News-driven Time Series Forecasting?
Can Competition Enhance the Proficiency of Agents Powered by Large Language Models in the Realm of News-driven Time Series Forecasting?
Yuxuan Zhang
Yangyang Feng
Daifeng Li
Kexin Zhang
Junlan Chen
Bowen Deng
LLMAG
AI4TS
26
0
0
14 Apr 2025
A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions
A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions
Chengyu Wang
Taolin Zhang
Richang Hong
Jun Huang
ReLM
LRM
34
1
0
12 Apr 2025
Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents
Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents
Manh Hung Nguyen
Victor-Alexandru Pădurean
Alkis Gotovos
Sebastian Tschiatschek
Adish Singla
24
0
0
10 Apr 2025
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Gleb Rodionov
Roman Garipov
Alina Shutova
George Yakushev
Vage Egiazarian
Anton Sinitsin
Denis Kuznedelev
Dan Alistarh
LRM
27
1
0
08 Apr 2025
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors
Fan Nie
Lan Feng
Haotian Ye
Weixin Liang
Pan Lu
Huaxiu Yao
Alexandre Alahi
James Zou
72
0
0
07 Apr 2025
Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning
Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning
Sugyeong Eo
Hyeonseok Moon
Evelyn Hayoon Zi
Chanjun Park
Heuiseok Lim
LLMAG
42
1
0
07 Apr 2025
Cognitive Debiasing Large Language Models for Decision-Making
Cognitive Debiasing Large Language Models for Decision-Making
Yougang Lyu
Shijie Ren
Yue Feng
Zihan Wang
Z. Chen
Z. Z. Ren
Maarten de Rijke
31
0
0
05 Apr 2025
Cultural Learning-Based Culture Adaptation of Language Models
Cultural Learning-Based Culture Adaptation of Language Models
Chen Cecilia Liu
Anna Korhonen
Iryna Gurevych
39
0
0
03 Apr 2025
CoLa -- Learning to Interactively Collaborate with Large LMs
CoLa -- Learning to Interactively Collaborate with Large LMs
Abhishek Sharma
Dan Goldwasser
LLMAG
SyDa
58
0
0
03 Apr 2025
Achieving Unanimous Consensus in Decision Making Using Multi-Agents
Achieving Unanimous Consensus in Decision Making Using Multi-Agents
Apurba Pokharel
Ram Dantu
Shakila Zaman
Sirisha Talapuru
Vinh Quach
38
1
0
02 Apr 2025
A Survey of Scaling in Large Language Model Reasoning
A Survey of Scaling in Large Language Model Reasoning
Zihan Chen
Song Wang
Zhen Tan
Xingbo Fu
Zhenyu Lei
Peng Wang
Huan Liu
Cong Shen
Jundong Li
LRM
84
0
0
02 Apr 2025
Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute
Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute
Jianhao Chen
Zishuo Xun
Bocheng Zhou
Han Qi
Qiaosheng Zhang
...
Wei Hu
Yuzhong Qu
W. Ouyang
Wanli Ouyang
Shuyue Hu
74
0
0
01 Apr 2025
The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances
The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances
Yining Wang
Y. Wang
Xi Li
Mi Zhang
Geng Hong
Min Yang
AAML
HILM
60
0
0
01 Apr 2025
$\textit{Agents Under Siege}$: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks
Agents Under Siege\textit{Agents Under Siege}Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks
Rana Muhammad Shahroz Khan
Zhen Tan
Sukwon Yun
Charles Flemming
Tianlong Chen
AAML
LLMAG
Presented at ResearchTrend Connect | LLMAG on 23 Apr 2025
86
2
0
31 Mar 2025
DebFlow: Automating Agent Creation via Agent Debate
DebFlow: Automating Agent Creation via Agent Debate
Jinwei Su
Yinghui Xia
Ronghua Shi
Jianhui Wang
Jianuo Huang
Y. Wang
Tianyu Shi
Yang Jingsong
Lewei He
30
0
0
31 Mar 2025
Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs
Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs
Sanjoy Chowdhury
Hanan Gani
Nishit Anand
Sayan Nag
Ruohan Gao
Mohamed Elhoseiny
Salman Khan
Dinesh Manocha
LRM
31
0
0
29 Mar 2025
Debate-Driven Multi-Agent LLMs for Phishing Email Detection
Debate-Driven Multi-Agent LLMs for Phishing Email Detection
Ngoc Tuong Vy Nguyen
Felix D Childress
Yunting Yin
LLMAG
31
1
0
27 Mar 2025
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
Xiaoye Qu
Yafu Li
Zhaochen Su
Weigao Sun
Jianhao Yan
...
Chaochao Lu
Yue Zhang
Xian-Sheng Hua
Bowen Zhou
Yu Cheng
ReLM
OffRL
LRM
76
11
0
27 Mar 2025
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
Zhuoshi Pan
Yu-Hu Li
Honglin Lin
Qizhi Pei
Zinan Tang
Wei Yu Wu
Chenlin Ming
H. V. Zhao
Conghui He
Lijun Wu
LRM
59
0
0
21 Mar 2025
When Debate Fails: Bias Reinforcement in Large Language Models
When Debate Fails: Bias Reinforcement in Large Language Models
Jihwan Oh
Minchan Jeong
Jongwoo Ko
Se-Young Yun
LLMAG
AI4CE
49
0
0
21 Mar 2025
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
David Wan
Justin Chih-Yao Chen
Elias Stengel-Eskin
Mohit Bansal
LLMAG
LRM
60
1
0
19 Mar 2025
Don't lie to your friends: Learning what you know from collaborative self-play
Don't lie to your friends: Learning what you know from collaborative self-play
Jacob Eisenstein
Reza Aghajani
Adam Fisch
Dheeru Dua
Fantine Huot
Mirella Lapata
Vicky Zayats
Jonathan Berant
66
0
0
18 Mar 2025
MDTeamGPT: A Self-Evolving LLM-based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation
MDTeamGPT: A Self-Evolving LLM-based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation
Kai-xiang Chen
X. Li
Tianpei Yang
Hewei Wang
Wei Dong
Yang Gao
LLMAG
LM&MA
67
2
0
18 Mar 2025
Temporal Consistency for LLM Reasoning Process Error Identification
Temporal Consistency for LLM Reasoning Process Error Identification
Jiacheng Guo
Yue Wu
Jiahao Qiu
Kaixuan Huang
Xinzhe Juan
L. Yang
Mengdi Wang
LRM
53
0
0
18 Mar 2025
Why Do Multi-Agent LLM Systems Fail?
Why Do Multi-Agent LLM Systems Fail?
Mert Cemri
Melissa Z. Pan
Shuyi Yang
Lakshya A Agrawal
Bhavya Chopra
...
Dan Klein
Kannan Ramchandran
Matei A. Zaharia
Joseph E. Gonzalez
Ion Stoica
LLMAG
Presented at ResearchTrend Connect | LLMAG on 23 Apr 2025
112
5
0
17 Mar 2025
Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition
Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition
Jiahang Cao
Qiang Zhang
Hanzhong Guo
Jiaxu Wang
Hao-Ran Cheng
Renjing Xu
DiffM
56
0
0
16 Mar 2025
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM Planning
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM Planning
Edward Y. Chang
Longling Geng
LLMAG
LRM
31
0
0
15 Mar 2025
Thinking Machines: A Survey of LLM based Reasoning Strategies
Dibyanayan Bandyopadhyay
Soham Bhattacharjee
Asif Ekbal
LRM
ELM
41
4
0
13 Mar 2025
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen
Zhengrong Yue
Siran Chen
Z. Wang
Yang Liu
Peng Li
Y. Wang
VLM
63
0
0
13 Mar 2025
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
Ziyu Wan
Yunxiang Li
Y. Song
Hanjing Wang
Linyi Yang
Mark W. Schmidt
J. Wang
Weinan Zhang
Shuyue Hu
Ying Wen
LLMAG
KELM
LRM
AI4CE
81
5
0
12 Mar 2025
Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study
Veronica Bot
Zheyuan Xu
LRM
LLMAG
LM&Ro
62
0
0
11 Mar 2025
Research on Superalignment Should Advance Now with Parallel Optimization of Competence and Conformity
HyunJin Kim
Xiaoyuan Yi
Jing Yao
Muhua Huang
Jinyeong Bak
James Evans
Xing Xie
34
0
0
08 Mar 2025
1234...8910
Next