ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14325
  4. Cited By
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate

Improving Factuality and Reasoning in Language Models through Multiagent Debate

23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
    LLMAG
    LRM
ArXivPDFHTML

Papers citing "Improving Factuality and Reasoning in Language Models through Multiagent Debate"

50 / 453 papers shown
Title
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to
  Complement Historical Analysis
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
Shuhang Lin
Wenyue Hua
Lingyao Li
Che-Jui Chang
Lizhou Fan
Jianchao Ji
Hang Hua
Mingyu Jin
Jiebo Luo
Yongfeng Zhang
LM&Ro
LLMAG
46
8
0
23 Apr 2024
Adaptive Collaboration Strategy for LLMs in Medical Decision Making
Adaptive Collaboration Strategy for LLMs in Medical Decision Making
Y. Kim
Chanwoo Park
Hyewon Jeong
Yik Siu Chan
X. Xu
Daniel J. McDuff
C. Breazeal
Hae Won Park
32
4
0
22 Apr 2024
ISQA: Informative Factuality Feedback for Scientific Summarization
ISQA: Informative Factuality Feedback for Scientific Summarization
Zekai Li
Yanxia Qin
Qian Liu
Min-Yen Kan
HILM
24
1
0
20 Apr 2024
AgentCoord: Visually Exploring Coordination Strategy for LLM-based
  Multi-Agent Collaboration
AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration
Bo Pan
Jiaying Lu
Ke Wang
Li Zheng
Zhen Wen
Yingchaojie Feng
Minfeng Zhu
Wei Chen
LLMAG
32
10
0
18 Apr 2024
Unveiling Imitation Learning: Exploring the Impact of Data Falsity to
  Large Language Model
Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model
Hyunsoo Cho
ALM
16
0
0
15 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent
  Deliberation
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Dongyeop Kang
LLMAG
30
3
0
14 Apr 2024
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models
Jinheon Baek
S. Jauhar
Silviu Cucerzan
Sung Ju Hwang
AI4CE
LLMAG
LM&Ro
34
36
0
11 Apr 2024
A Survey on the Integration of Generative AI for Critical Thinking in
  Mobile Networks
A Survey on the Integration of Generative AI for Critical Thinking in Mobile Networks
Athanasios Karapantelakis
Alexandros Nikou
Ajay Kattepur
Jean Martins
Leonid Mokrushin
S. Mohalik
Marin Orlic
Aneta Vulgarakis Feljan
19
1
0
10 Apr 2024
Can Feedback Enhance Semantic Grounding in Large Vision-Language Models?
Can Feedback Enhance Semantic Grounding in Large Vision-Language Models?
Yuan-Hong Liao
Rafid Mahmood
Sanja Fidler
David Acuna
VLM
44
7
0
09 Apr 2024
RoT: Enhancing Large Language Models with Reflection on Search Trees
RoT: Enhancing Large Language Models with Reflection on Search Trees
Wenyang Hui
Kewei Tu
LRM
27
6
0
08 Apr 2024
Social Skill Training with Large Language Models
Social Skill Training with Large Language Models
Diyi Yang
Caleb Ziems
William B. Held
Omar Shaikh
Michael S. Bernstein
John C. Mitchell
LLMAG
26
6
0
05 Apr 2024
MIMIR: A Streamlined Platform for Personalized Agent Tuning in Domain
  Expertise
MIMIR: A Streamlined Platform for Personalized Agent Tuning in Domain Expertise
Chunyuan Deng
Xiangru Tang
Yilun Zhao
Hanming Wang
Haoran Wang
Wangchunshu Zhou
Arman Cohan
Mark B. Gerstein
LLMAG
MLLM
20
1
0
03 Apr 2024
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as
  Generalist via Expert Token Routing
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Ziwei Chai
Guoyin Wang
Jing Su
Tianjie Zhang
Xuanwen Huang
...
Jingjing Xu
Jianbo Yuan
Hongxia Yang
Fei Wu
Yang Yang
26
6
0
25 Mar 2024
AIOS: LLM Agent Operating System
AIOS: LLM Agent Operating System
Kai Mei
Zelong Li
Wujiang Xu
Wenyue Hua
Mingyu Jin
Yongfeng Zhang
Shuyuan Xu
Ruosong Ye
Yingqiang Ge
Yongfeng Zhang
LLMAG
26
17
0
25 Mar 2024
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Neeloy Chakraborty
Melkior Ornik
Katherine Driggs-Campbell
LRM
54
9
0
25 Mar 2024
A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal
  Reasoning
A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning
Changmeng Zheng
Dayong Liang
Wengyu Zhang
Xiao Wei
Tat-Seng Chua
Qing Li
27
1
0
22 Mar 2024
Content Knowledge Identification with Multi-Agent Large Language Models
  (LLMs)
Content Knowledge Identification with Multi-Agent Large Language Models (LLMs)
Kaiqi Yang
Yucheng Chu
Taylor Darwin
Ahreum Han
Hang Li
Hongzhi Wen
Yasemin Copur-Gencturk
Jiliang Tang
Hui Liu
27
12
0
22 Mar 2024
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought
  Prompting
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
LRM
AI4CE
ReLM
47
6
0
21 Mar 2024
ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion
  Classification
ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification
Sehee Lim
Yejin Kim
Chi-Hyun Choi
Jy-yong Sohn
Byung-Hoon Kim
23
1
0
21 Mar 2024
Securing Large Language Models: Threats, Vulnerabilities and Responsible
  Practices
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Sara Abdali
Richard Anarfi
C. Barberan
Jia He
PILM
58
22
0
19 Mar 2024
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of
  MLLM
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
YiXuan Wu
Yizhou Wang
Shixiang Tang
Wenhao Wu
Tong He
Wanli Ouyang
Jian Wu
Philip H. S. Torr
ObjD
VLM
22
18
0
19 Mar 2024
Can LLM-Augmented autonomous agents cooperate?, An evaluation of their
  cooperative capabilities through Melting Pot
Can LLM-Augmented autonomous agents cooperate?, An evaluation of their cooperative capabilities through Melting Pot
Manuel Mosquera
Juan Sebastian Pinzon
Manuel Rios
Yesid Fonseca
Luis Felipe Giraldo
Nicanor Quijano
Ruben Manrique
19
2
0
18 Mar 2024
A Survey on Game Playing Agents and Large Models: Methods, Applications,
  and Challenges
A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges
Xinrun Xu
Yuxin Wang
Chaoyi Xu
Ziluo Ding
Jiechuan Jiang
Zhiming Ding
Börje F. Karlsson
LM&Ro
LLMAG
70
13
0
15 Mar 2024
Hierarchical Auto-Organizing System for Open-Ended Multi-Agent
  Navigation
Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Zhonghan Zhao
Kewei Chen
Dongxu Guo
Wenhao Chai
Tianbo Ye
Yanting Zhang
Gaoang Wang
53
18
0
13 Mar 2024
ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware
  Repair of Access Control Vulnerabilities in Smart Contracts
ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts
Lyuye Zhang
Kaixuan Li
Kairan Sun
Daoyuan Wu
Ye Liu
Haoye Tian
Yang Liu
46
20
0
11 Mar 2024
$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception
  Models under Perturbations
R2\text{R}^2R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Xiang Li
Kai Qiu
Jinglu Wang
Xiaohao Xu
Rita Singh
Kashu Yamazaki
Hao Chen
Xiaonan Huang
Bhiksha Raj
VOS
32
1
0
07 Mar 2024
Learning to Use Tools via Cooperative and Interactive Agents
Learning to Use Tools via Cooperative and Interactive Agents
Zhengliang Shi
Shen Gao
Xiuyi Chen
Zhumin Chen
Lingyong Yan
Haibo Shi
Dawei Yin
Pengjie Ren
Suzan Verberne
Zhaochun Ren
LLMAG
21
16
0
05 Mar 2024
Are More LLM Calls All You Need? Towards Scaling Laws of Compound
  Inference Systems
Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Ion Stoica
Matei A. Zaharia
James Y. Zou
LRM
24
0
0
04 Mar 2024
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
Yifan Zeng
Yiran Wu
Xiao Zhang
Huazheng Wang
Qingyun Wu
LLMAG
AAML
35
57
0
02 Mar 2024
Controllable Preference Optimization: Toward Controllable
  Multi-Objective Alignment
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment
Yiju Guo
Ganqu Cui
Lifan Yuan
Ning Ding
Jiexin Wang
...
Ruobing Xie
Jie Zhou
Yankai Lin
Zhiyuan Liu
Maosong Sun
28
17
0
29 Feb 2024
Beyond Natural Language: LLMs Leveraging Alternative Formats for
  Enhanced Reasoning and Communication
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication
Weize Chen
Chenfei Yuan
Jiarui Yuan
Yusheng Su
Cheng Qian
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
20
9
0
28 Feb 2024
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the
  Key?
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
Qineng Wang
Zihao W. Wang
Ying Su
Hanghang Tong
Yangqiu Song
LLMAG
LRM
23
57
0
28 Feb 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and
  Optimization
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Wenqi Zhang
Ke Tang
Hai Wu
Mengna Wang
Yongliang Shen
Guiyang Hou
Zeqi Tan
Peng Li
Y. Zhuang
Weiming Lu
LLMAG
23
33
0
27 Feb 2024
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical
  Reasoning
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning
Debrup Das
Debopriyo Banerjee
Somak Aditya
Ashish Kulkarni
ReLM
LRM
16
10
0
27 Feb 2024
Large Language Model for Participatory Urban Planning
Large Language Model for Participatory Urban Planning
Zhilun Zhou
Yuming Lin
Depeng Jin
Yong Li
LLMAG
31
24
0
27 Feb 2024
Language Agents as Optimizable Graphs
Language Agents as Optimizable Graphs
Mingchen Zhuge
Wenyi Wang
Louis Kirsch
Francesco Faccio
Dmitrii Khizbullin
Jürgen Schmidhuber
LLMAG
21
19
0
26 Feb 2024
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent
  LLMs
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent LLMs
Sumedh Rasal
E. Hauer
17
0
0
26 Feb 2024
GenAINet: Enabling Wireless Collective Intelligence via Knowledge Transfer and Reasoning
GenAINet: Enabling Wireless Collective Intelligence via Knowledge Transfer and Reasoning
Han Zou
Qiyang Zhao
Lina Bariah
Yu Tian
M. Bennis
S. Lasaulce
91
12
0
26 Feb 2024
Reward Design for Justifiable Sequential Decision-Making
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
19
0
0
24 Feb 2024
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
Xiaolong Wang
Yile Wang
Sijie Cheng
Peng Li
Yang Janet Liu
18
4
0
23 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
32
43
0
22 Feb 2024
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large
  Vision-Language Models
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Xueliang Zhao
Xinting Huang
Tingchen Fu
Qintong Li
Shansan Gong
Lemao Liu
Wei Bi
Lingpeng Kong
LRM
31
1
0
21 Feb 2024
AgentScope: A Flexible yet Robust Multi-Agent Platform
AgentScope: A Flexible yet Robust Multi-Agent Platform
Dawei Gao
Zitao Li
Xuchen Pan
Weirui Kuang
Zhijian Ma
...
Chen Cheng
Hongzhu Shi
Yaliang Li
Bolin Ding
Jingren Zhou
LLMAG
22
25
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Jundong Li
Lu Cheng
Huan Liu
SyDa
42
44
0
21 Feb 2024
Soft Self-Consistency Improves Language Model Agents
Soft Self-Consistency Improves Language Model Agents
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Mohit Bansal
LLMAG
24
7
0
20 Feb 2024
Defending Jailbreak Prompts via In-Context Adversarial Game
Defending Jailbreak Prompts via In-Context Adversarial Game
Yujun Zhou
Yufei Han
Haomin Zhuang
Kehan Guo
Zhenwen Liang
Hongyan Bao
Xiangliang Zhang
LLMAG
AAML
19
11
0
20 Feb 2024
Evolving AI Collectives to Enhance Human Diversity and Enable
  Self-Regulation
Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation
Shiyang Lai
Yujin Potter
Junsol Kim
Richard Zhuang
Dawn Song
James Evans
48
3
0
19 Feb 2024
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of
  Large Language Models
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models
Loka Li
Zhenhao Chen
Guan-Hong Chen
Yixuan Zhang
Yusheng Su
Eric P. Xing
Kun Zhang
LRM
36
15
0
19 Feb 2024
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via
  Game-Theoretic Evaluations
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
Jinhao Duan
Renming Zhang
James Diffenderfer
B. Kailkhura
Lichao Sun
Elias Stengel-Eskin
Mohit Bansal
Tianlong Chen
Kaidi Xu
ELM
LRM
29
55
0
19 Feb 2024
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM
  Agents
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM Agents
Zengqing Wu
Run Peng
Shuyuan Zheng
Qianying Liu
Xu Han
Brian Inhyuk Kwon
Makoto Onizuka
Shaojie Tang
Chuan Xiao
28
10
0
19 Feb 2024
Previous
123...1056789
Next