ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14325
  4. Cited By
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate

Improving Factuality and Reasoning in Language Models through Multiagent Debate

23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
    LLMAG
    LRM
ArXivPDFHTML

Papers citing "Improving Factuality and Reasoning in Language Models through Multiagent Debate"

50 / 453 papers shown
Title
Autonomous Prompt Engineering in Large Language Models
Autonomous Prompt Engineering in Large Language Models
Daan Kepel
Konstantina Valogianni
LLMAG
29
4
0
25 Jun 2024
On the Transformations across Reward Model, Parameter Update, and
  In-Context Prompt
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt
Deng Cai
Huayang Li
Tingchen Fu
Siheng Li
Weiwen Xu
...
Leyang Cui
Yan Wang
Lemao Liu
Taro Watanabe
Shuming Shi
KELM
26
2
0
24 Jun 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
62
4
0
22 Jun 2024
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in
  Large Language Model Collaborations via Debate
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Alfonso Amayuelas
Xianjun Yang
Antonis Antoniades
Wenyue Hua
Liangming Pan
William Wang
AAML
LLMAG
30
9
0
20 Jun 2024
Adversaries Can Misuse Combinations of Safe Models
Adversaries Can Misuse Combinations of Safe Models
Erik Jones
Anca Dragan
Jacob Steinhardt
40
6
0
20 Jun 2024
From Single Agent to Multi-Agent: Improving Traffic Signal Control
From Single Agent to Multi-Agent: Improving Traffic Signal Control
Maksim Tislenko
Dmitrii Kisilev
26
0
0
19 Jun 2024
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
Tianyuan Zou
Yang Janet Liu
Peng Li
Jianqing Zhang
Jingjing Liu
Ya-Qin Zhang
26
3
0
18 Jun 2024
Problem-Solving in Language Model Networks
Problem-Solving in Language Model Networks
Ciaran Regan
Alexandre Gournail
Mizuki Oka
LRM
LLMAG
KELM
24
1
0
18 Jun 2024
Improving Multi-Agent Debate with Sparse Communication Topology
Improving Multi-Agent Debate with Sparse Communication Topology
Yunxuan Li
Yibing Du
Jiageng Zhang
Le Hou
Peter Grabowski
Yeqing Li
Eugene Ie
LLMAG
26
18
0
17 Jun 2024
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Yi Fang
Moxin Li
Wenjie Wang
Hui Lin
Fuli Feng
LRM
49
5
0
17 Jun 2024
KAOS: Large Model Multi-Agent Operating System
KAOS: Large Model Multi-Agent Operating System
Zhao Zhuo
Rongzhen Li
Kai Liu
Huhai Zou
KaiMao Li
Jie Yu
Tianhao Sun
Qingbo Wu
VLM
LLMAG
27
0
0
17 Jun 2024
SLEGO: A Collaborative Data Analytics System with LLM Recommender for
  Diverse Users
SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users
Siu Lung Ng
Hirad Rezaei
F. Rabhi
18
0
0
17 Jun 2024
AgileCoder: Dynamic Collaborative Agents for Software Development based
  on Agile Methodology
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology
Minh Huynh Nguyen
Thang Phan Chau
Phong X. Nguyen
Nghi D. Q. Bui
26
11
0
16 Jun 2024
From Text to Life: On the Reciprocal Relationship between Artificial
  Life and Large Language Models
From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models
Eleni Nisioti
Claire Glanois
Elias Najarro
Andrew Dai
Elliot Meyerson
J. Pedersen
Laetitia Teodorescu
Conor F. Hayes
Shyam Sudhakaran
Sebastian Risi
AI4CE
LM&Ro
35
2
0
14 Jun 2024
Multi-Agent Software Development through Cross-Team Collaboration
Multi-Agent Software Development through Cross-Team Collaboration
Zhuoyun Du
Chen Qian
Wei Liu
Zihao Xie
Yifei Wang
Yufan Dang
Weize Chen
Cheng Yang
LLMAG
36
16
0
13 Jun 2024
StreamBench: Towards Benchmarking Continuous Improvement of Language
  Agents
StreamBench: Towards Benchmarking Continuous Improvement of Language Agents
Cheng-Kuang Wu
Zhi Rui Tam
Chieh-Yen Lin
Yun-Nung Chen
Hung-yi Lee
LLMAG
19
6
0
13 Jun 2024
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo
  Tree Self-refine with LLaMa-3 8B
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Di Zhang
Xiaoshui Huang
Dongzhan Zhou
Yuqiang Li
Wanli Ouyang
LRM
33
51
0
11 Jun 2024
CoEvol: Constructing Better Responses for Instruction Finetuning through
  Multi-Agent Cooperation
CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation
Renhao Li
Minghuan Tan
Derek F. Wong
Min Yang
LLMAG
19
1
0
11 Jun 2024
Scaling Large Language Model-based Multi-Agent Collaboration
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian
Zihao Xie
YiFei Wang
Wei Liu
Yufan Dang
...
Zhuoyun Du
Weize Chen
Cheng Yang
Zhiyuan Liu
Maosong Sun
AI4CE
LLMAG
LM&Ro
54
42
0
11 Jun 2024
Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications
Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications
Junlin Wang
Tianyi Yang
Roy Xie
Bhuwan Dhingra
SILM
AAML
29
3
0
10 Jun 2024
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning
  Strategies
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Junlin Wang
Siddhartha Jain
Dejiao Zhang
Baishakhi Ray
Varun Kumar
Ben Athiwaratkun
27
19
0
10 Jun 2024
Mixture-of-Agents Enhances Large Language Model Capabilities
Mixture-of-Agents Enhances Large Language Model Capabilities
Junlin Wang
Jue Wang
Ben Athiwaratkun
Ce Zhang
James Zou
LLMAG
AIFin
36
94
0
07 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
32
18
0
06 Jun 2024
A Survey of Language-Based Communication in Robotics
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
47
11
0
06 Jun 2024
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent
  Debate Framework
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Xiaoxi Sun
Jinpeng Li
Yan Zhong
Dongyan Zhao
Rui Yan
LLMAG
HILM
24
5
0
05 Jun 2024
Chain of Agents: Large Language Models Collaborating on Long-Context
  Tasks
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
Yusen Zhang
Ruoxi Sun
Yanfei Chen
Tomas Pfister
Rui Zhang
Sercan Ö. Arik
RALM
AI4CE
LLMAG
42
28
0
04 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future
  Pathways
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
42
19
0
04 Jun 2024
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of
  Self-Correction of LLMs
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
Yusen Zhang
Nan Zhang
Jiawei Han
Rui Zhang
LRM
40
57
0
03 Jun 2024
Brainstorming Brings Power to Large Language Models of Knowledge
  Reasoning
Brainstorming Brings Power to Large Language Models of Knowledge Reasoning
Zining Qin
Chenhao Wang
Huiling Qin
Weijia Jia
LRM
24
1
0
02 Jun 2024
Harnessing Business and Media Insights with Large Language Models
Harnessing Business and Media Insights with Large Language Models
Yujia Bao
Ankit Parag Shah
Neeru Narang
Jonathan Rivers
Rajeev Maksey
...
Gyuhak Kim
Dengpan Yin
Don Hejna
Mo Nomeli
Wei Wei
AIFin
38
2
0
02 Jun 2024
ANAH: Analytical Annotation of Hallucinations in Large Language Models
ANAH: Analytical Annotation of Hallucinations in Large Language Models
Ziwei Ji
Yuzhe Gu
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai-xiang Chen
HILM
41
2
0
30 May 2024
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles
  and Committee Discussions
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Ruochen Zhao
Wenxuan Zhang
Yew Ken Chia
Deli Zhao
Lidong Bing
30
9
0
30 May 2024
Cracking the Code of Juxtaposition: Can AI Models Understand the
  Humorous Contradictions
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Zhe Hu
Tuo Liang
Jing Li
Yiren Lu
Yunlai Zhou
Yiran Qiao
Jing Ma
Yu Yin
36
4
0
29 May 2024
Adaptive In-conversation Team Building for Language Model Agents
Adaptive In-conversation Team Building for Language Model Agents
Linxin Song
Jiale Liu
Jieyu Zhang
Shaokun Zhang
Ao Luo
Shijian Wang
Qingyun Wu
Chi Wang
LLMAG
52
9
0
29 May 2024
Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents
Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents
Zhengliang Shi
Shen Gao
Xiuyi Chen
Yue Feng
Lingyong Yan
Haibo Shi
Dawei Yin
Zhumin Chen
Suzan Verberne
LLMAG
47
6
0
26 May 2024
Confidence Under the Hood: An Investigation into the
  Confidence-Probability Alignment in Large Language Models
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models
Abhishek Kumar
Robert D Morabito
Sanzhar Umbet
Jad Kabbara
Ali Emami
46
5
0
25 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
114
22
0
20 May 2024
Medical Dialogue: A Survey of Categories, Methods, Evaluation and
  Challenges
Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi
Zeming Liu
Li Du
Yuxuan Wang
Hongru Wang
Yuhang Guo
Tong Ruan
Jie Xu
Shaoting Zhang
LM&MA
ELM
32
1
0
17 May 2024
Agent Design Pattern Catalogue: A Collection of Architectural Patterns
  for Foundation Model based Agents
Agent Design Pattern Catalogue: A Collection of Architectural Patterns for Foundation Model based Agents
Yue Liu
Sin Kit Lo
Qinghua Lu
Liming Zhu
Dehai Zhao
Xiwei Xu
Stefan Harrer
Jon Whittle
LLMAG
AI4CE
20
10
0
16 May 2024
DEBATE: Devil's Advocate-Based Assessment and Text Evaluation
DEBATE: Devil's Advocate-Based Assessment and Text Evaluation
Alex G. Kim
Keonwoo Kim
Sangwon Yoon
ELM
16
1
0
16 May 2024
DuetRAG: Collaborative Retrieval-Augmented Generation
DuetRAG: Collaborative Retrieval-Augmented Generation
Dian Jiao
Li Cai
Jingsheng Huang
Wenqiao Zhang
Siliang Tang
Yueting Zhuang
22
0
0
12 May 2024
Event GDR: Event-Centric Generative Document Retrieval
Event GDR: Event-Centric Generative Document Retrieval
Yong Guan
Dingxiao Liu
Jinchen Ma
Hao Peng
Xiaozhi Wang
Lei Hou
Ru Li
16
1
0
11 May 2024
LLM Discussion: Enhancing the Creativity of Large Language Models via
  Discussion Framework and Role-Play
LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Li-Chun Lu
Shou-Jen Chen
Tsung-Min Pai
Chan-Hung Yu
Hung-yi Lee
Shao-Hua Sun
LLMAG
43
38
0
10 May 2024
Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency
  for Tool Planning
Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning
Junzhi Chen
Juhao Liang
Benyou Wang
LLMAG
21
2
0
09 May 2024
Chain of Thoughtlessness? An Analysis of CoT in Planning
Chain of Thoughtlessness? An Analysis of CoT in Planning
Kaya Stechly
Karthik Valmeekam
Subbarao Kambhampati
LRM
LM&Ro
54
35
0
08 May 2024
Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in
  Structured Finance: The Application of Multi-agent Framework
Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework
Xiangpeng Wan
Haicheng Deng
Kai Zou
Shiqi Xu
LLMAG
21
2
0
07 May 2024
Conformity, Confabulation, and Impersonation: Persona Inconstancy in
  Multi-Agent LLM Collaboration
Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration
Razan Baltaji
Babak Hemmatian
L. Varshney
LLMAG
24
1
0
06 May 2024
Argumentative Large Language Models for Explainable and Contestable Claim Verification
Argumentative Large Language Models for Explainable and Contestable Claim Verification
Gabriel Freedman
Adam Dejl
Deniz Gorur
Xiang Yin
Antonio Rago
Francesca Toni
23
7
0
03 May 2024
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for
  Complex Problem Solving
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving
Pei Chen
Boran Han
Shuai Zhang
LRM
LLMAG
27
4
0
26 Apr 2024
When to Trust LLMs: Aligning Confidence with Response Quality
When to Trust LLMs: Aligning Confidence with Response Quality
Shuchang Tao
Liuyi Yao
Hanxing Ding
Yuexiang Xie
Qi Cao
Fei Sun
Jinyang Gao
Huawei Shen
Bolin Ding
24
15
0
26 Apr 2024
Previous
123456...8910
Next