Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.14325
Cited By
Improving Factuality and Reasoning in Language Models through Multiagent Debate
23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAG
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Factuality and Reasoning in Language Models through Multiagent Debate"
50 / 453 papers shown
Title
Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Tianlin Li
Xiaoyu Zhang
Chao Du
Tianyu Pang
Qian Liu
Qing-Wu Guo
Chao Shen
Yang Liu
ALM
31
9
0
19 Feb 2024
LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration
Jun Zhao
Can Zu
Haotian Xu
Yi Lu
Wei He
Yiwen Ding
Tao Gui
Qi Zhang
Xuanjing Huang
RALM
LLMAG
31
20
0
18 Feb 2024
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Tianyi Zhou
66
17
0
16 Feb 2024
Retrieve Only When It Needs: Adaptive Retrieval Augmentation for Hallucination Mitigation in Large Language Models
Hanxing Ding
Liang Pang
Zihao Wei
Huawei Shen
Xueqi Cheng
HILM
RALM
67
15
0
16 Feb 2024
Language Models with Conformal Factuality Guarantees
Christopher Mohri
Tatsunori Hashimoto
HILM
34
33
0
15 Feb 2024
Not Just Novelty: A Longitudinal Study on Utility and Customization of an AI Workflow
Tao Long
Katy Ilonka Gero
Lydia B. Chilton
23
13
0
15 Feb 2024
Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data
Haoyang Liu
Yijiang Li
Jinglin Jian
Yuxuan Cheng
Jianrong Lu
Shuyi Guo
Jinglei Zhu
Mianchen Zhang
Miantong Zhang
Haohan Wang
13
4
0
15 Feb 2024
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Xiangming Gu
Xiaosen Zheng
Tianyu Pang
Chao Du
Qian Liu
Ye Wang
Jing Jiang
Min-Bin Lin
LLMAG
LM&Ro
35
47
0
13 Feb 2024
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Kaya Stechly
Karthik Valmeekam
Subbarao Kambhampati
ReLM
LRM
20
48
0
12 Feb 2024
Large Language Models as Agents in Two-Player Games
Yang Liu
Peng Sun
Hang Li
LLMAG
32
4
0
12 Feb 2024
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate
Kyungha Kim
Sangyun Lee
Kung-Hsiang Huang
Hou Pong Chan
Manling Li
Heng Ji
LRM
49
37
0
12 Feb 2024
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
Yuxi Wei
Zi Wang
Yifan Lu
Chenxin Xu
Chang-rui Liu
Hao Zhao
Siheng Chen
Yanfeng Wang
VGen
57
54
0
08 Feb 2024
Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation
Xianghe Pang
Shuo Tang
Rui Ye
Yuxin Xiong
Bolun Zhang
Yanfeng Wang
Siheng Chen
111
27
0
08 Feb 2024
LLM Multi-Agent Systems: Challenges and Open Problems
Shanshan Han
Qifan Zhang
Yuhang Yao
Weizhao Jin
Zhaozhuo Xu
LLMAG
35
10
0
05 Feb 2024
Factuality of Large Language Models in the Year 2024
Yuxia Wang
Minghan Wang
Muhammad Arslan Manzoor
Fei Liu
Georgi Georgiev
Rocktim Jyoti Das
Preslav Nakov
LRM
HILM
30
7
0
04 Feb 2024
More Agents Is All You Need
Junyou Li
Qin Zhang
Yangbin Yu
Qiang Fu
Deheng Ye
LLMAG
136
57
0
03 Feb 2024
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
Justin Chih-Yao Chen
Swarnadeep Saha
Elias Stengel-Eskin
Mohit Bansal
LRM
LLMAG
30
13
0
02 Feb 2024
Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning
D. Bhattacharjya
Junkyu Lee
Don Joven Agravante
Balaji Ganesan
Radu Marinescu
LLMAG
30
1
0
02 Feb 2024
Compositional Generative Modeling: A Single Model is Not All You Need
Yilun Du
L. Kaelbling
PINN
GAN
46
19
0
02 Feb 2024
Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning
Jitao Sang
Yuhang Wang
Jing Zhang
Yanxu Zhu
Chao Kong
Junhong Ye
Shuyu Wei
Jinlin Xiao
23
1
0
01 Feb 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
18
22
0
01 Feb 2024
LLM Voting: Human Choices and AI Collective Decision Making
Joshua C. Yang
Damian Dailisan
Marcin Korecki
C. I. Hausladen
Dirk Helbing
24
18
0
31 Jan 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
16
6
0
31 Jan 2024
Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks
Wenyue Hua
Jiang Guo
Mingwen Dong
He Zhu
Patrick K. L. Ng
Zhiguo Wang
KELM
56
17
0
31 Jan 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
17
27
0
28 Jan 2024
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Mirac Suzgun
Adam Tauman Kalai
KELM
LRM
LLMAG
ReLM
38
63
0
23 Jan 2024
Automated Fact-Checking of Climate Change Claims with Large Language Models
Markus Leippold
S. Vaghefi
Dominik Stammbach
V. Muccione
J. Bingler
...
Tobias Schimanski
Glen Gostlow
Ting Yu
Juerg Luterbacher
C. Huggel
23
9
0
23 Jan 2024
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Taicheng Guo
Xiuying Chen
Yaqi Wang
Ruidi Chang
Shichao Pei
Nitesh V. Chawla
Olaf Wiest
Xiangliang Zhang
LLMAG
LM&Ro
AI4CE
LRM
29
243
0
21 Jan 2024
Emergent Dominance Hierarchies in Reinforcement Learning Agents
Ram Rachum
Yonatan Nakar
Bill Tomlinson
Nitay Alon
Reuth Mirsky
14
0
0
21 Jan 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
Songyang Gao
Qiming Ge
Wei Shen
Shihan Dou
Junjie Ye
...
Yicheng Zou
Zhi Chen
Hang Yan
Qi Zhang
Dahua Lin
39
10
0
21 Jan 2024
Generative AI in EU Law: Liability, Privacy, Intellectual Property, and Cybersecurity
Claudio Novelli
F. Casolari
Philipp Hacker
Giorgio Spedicato
Luciano Floridi
AILaw
SILM
42
41
0
14 Jan 2024
Evolving Code with A Large Language Model
Erik Hemberg
Stephen Moskal
Una-May O’Reilly
20
24
0
13 Jan 2024
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Junchen Wan
Fuzheng Zhang
Di Zhang
Ji-Rong Wen
KELM
31
32
0
11 Jan 2024
Combating Adversarial Attacks with Multi-Agent Debate
Steffi Chern
Zhen Fan
Andy Liu
AAML
34
5
0
11 Jan 2024
Designing Heterogeneous LLM Agents for Financial Sentiment Analysis
Frank Xing
AIFin
17
49
0
11 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
52
56
0
11 Jan 2024
Why Solving Multi-agent Path Finding with Large Language Model has not Succeeded Yet
Weizhe Chen
Sven Koenig
B. Dilkina
LM&Ro
LLMAG
AI4CE
59
16
0
08 Jan 2024
XUAT-Copilot: Multi-Agent Collaborative System for Automated User Acceptance Testing with Large Language Model
Zhitao Wang
Wei Wang
Zirao Li
Long Wang
Can Yi
Xinjie Xu
Luyang Cao
Hanjing Su
Shouzhi Chen
Jun Zhou
ALM
LLMAG
21
7
0
05 Jan 2024
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Qiuying Peng
Jun Wang
Y. Zhuang
Weiming Lu
LRM
LLMAG
22
37
0
04 Jan 2024
LLM Harmony: Multi-Agent Communication for Problem Solving
Sumedh Rasal
LLMAG
17
20
0
02 Jan 2024
LLM-SAP: Large Language Models Situational Awareness Based Planning
Liman Wang
Hanyang Zhong
LLMAG
23
2
0
26 Dec 2023
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
22
16
0
24 Dec 2023
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
...
Yicheng Luo
Jianye Hao
Kun Shao
Haitham Bou-Ammar
Jun Wang
22
17
0
22 Dec 2023
Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows
Madeleine Grunde-McLaughlin
Michelle S. Lam
Ranjay Krishna
Daniel S. Weld
Jeffrey Heer
AI4CE
43
20
0
18 Dec 2023
Social Learning: Towards Collaborative Learning with Large Language Models
Amirkeivan Mohtashami
Florian Hartmann
Sian Gooding
Lukás Zilka
Matt Sharifi
Blaise Agüera y Arcas
6
10
0
18 Dec 2023
The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation
Rongwu Xu
Brian S. Lin
Shujian Yang
Tianqi Zhang
Weiyan Shi
Tianwei Zhang
Zhixuan Fang
Wei Xu
Han Qiu
31
47
0
14 Dec 2023
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Haoran Liao
Qinyi Du
Shaohua Hu
Hao He
Yanyan Xu
Jidong Tian
Yaohui Jin
LRM
AI4CE
25
1
0
14 Dec 2023
Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System
Haotian Wang
Xiyuan Du
Weijiang Yu
Qianglong Chen
Kun Zhu
Zheng Chu
Lian Yan
Yi Guan
20
10
0
08 Dec 2023
Playing Large Games with Oracles and AI Debate
Xinyi Chen
Angelica Chen
Dean Foster
Elad Hazan
25
3
0
08 Dec 2023
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem
Yingqiang Ge
Yujie Ren
Wenyue Hua
Shuyuan Xu
Juntao Tan
Yongfeng Zhang
LLMAG
12
27
0
06 Dec 2023
Previous
1
2
3
...
10
6
7
8
9
Next