Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.14325
Cited By
Improving Factuality and Reasoning in Language Models through Multiagent Debate
23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAG
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Factuality and Reasoning in Language Models through Multiagent Debate"
50 / 453 papers shown
Title
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System
Weize Chen
Jiarui Yuan
Chen Qian
Cheng Yang
Zhiyuan Liu
Maosong Sun
LLMAG
26
4
0
10 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
55
6
0
10 Oct 2024
ReIFE: Re-evaluating Instruction-Following Evaluation
Yixin Liu
Kejian Shi
Alexander R. Fabbri
Yilun Zhao
Peifeng Wang
Chien-Sheng Wu
Shafiq Joty
Arman Cohan
14
6
0
09 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
43
7
0
09 Oct 2024
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Mehul Damani
Idan Shenfeld
Andi Peng
Andreea Bobu
Jacob Andreas
34
14
0
07 Oct 2024
Leveraging Large Language Models for Suicide Detection on Social Media with Limited Labels
Vy Nguyen
Chau Pham
ALM
AI4MH
29
2
0
06 Oct 2024
MindScope: Exploring cognitive biases in large language models through Multi-Agent Systems
Zhentao Xie
Jiabao Zhao
Yilei Wang
Jinxin Shi
Yanhong Bai
Xingjiao Wu
Liang He
LLMAG
26
0
0
06 Oct 2024
Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Chunkit Chan
Cheng Jiayang
Xin Liu
Yauwai Yim
Yuxin Jiang
Zheye Deng
Haoran Li
Yangqiu Song
Ginny Y. Wong
Simon See
26
0
0
05 Oct 2024
Are Expert-Level Language Models Expert-Level Annotators?
Yu-Min Tseng
Wei-Lin Chen
Chung-Chi Chen
Hsin-Hsi Chen
ALM
29
0
0
04 Oct 2024
Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Guibin Zhang
Yanwei Yue
Zhixun Li
Sukwon Yun
Guancheng Wan
Kun Wang
Dawei Cheng
Jeffrey Xu Yu
Tianlong Chen
32
3
0
03 Oct 2024
Collective Critics for Creative Story Generation
Minwook Bae
Hyounghun Kim
16
2
0
03 Oct 2024
Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics
Yuan Zhou
Peng Zhang
Mengya Song
Alice Zheng
Yiwen Lu
Zhiheng Liu
Yong Chen
Zhaohan Xi
LM&MA
27
1
0
02 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
50
1
0
02 Oct 2024
TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking
Danqing Wang
Jianxin Ma
Fei Fang
Lei Li
LLMAG
LRM
50
0
0
02 Oct 2024
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Weitong Zhang
Chengqi Zang
Bernhard Kainz
16
0
0
01 Oct 2024
From Facts to Insights: A Study on the Generation and Evaluation of Analytical Reports for Deciphering Earnings Calls
Tomas Goldsack
Yang Wang
Chenghua Lin
Chung-Chi Chen
13
2
0
01 Oct 2024
Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface
Wenyue Hua
Mengting Wan
Shashank Vadrevu
Ryan Nadel
Yongfeng Zhang
Chi Wang
LLMAG
24
1
0
30 Sep 2024
Data Analysis in the Era of Generative AI
J. Inala
Chenglong Wang
Steven Drucker
Gonzalo Ramos
Victor C. Dibia
N. Riche
Dave Brown
Dan Marshall
Jianfeng Gao
20
6
0
27 Sep 2024
Attention Prompting on Image for Large Vision-Language Models
Runpeng Yu
Weihao Yu
Xinchao Wang
VLM
28
5
0
25 Sep 2024
Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
Samuel Arnesen
David Rein
Julian Michael
ELM
20
3
0
25 Sep 2024
Evaluating and Enhancing Large Language Models for Novelty Assessment in Scholarly Publications
Ethan Lin
Zhiyuan Peng
Yi Fang
31
4
0
25 Sep 2024
COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models
Kehui Liu
Zixin Tang
Dong Wang
Z. Wang
Bin Zhao
Bin Zhao
24
9
0
23 Sep 2024
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion
Tongxuan Liu
Xingyu Wang
Weizhe Huang
Wenjiang Xu
Yuting Zeng
Lei Jiang
Hailong Yang
Jing Li
LLMAG
16
8
0
21 Sep 2024
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
Justin Chih-Yao Chen
Archiki Prasad
Swarnadeep Saha
Elias Stengel-Eskin
Mohit Bansal
LRM
21
0
0
18 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLM
LRM
93
79
0
18 Sep 2024
Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent
Fatemeh Haji
Mazal Bethany
Maryam Tabar
Jason Chiang
Anthony Rios
Peyman Najafirad
LLMAG
LRM
AI4CE
29
4
0
17 Sep 2024
Towards Agentic AI on Particle Accelerators
Antonin Sulc
Thorsten Hellert
Raimund Kammering
Hayden Houscher
Jason St. John
28
1
0
10 Sep 2024
LLM-based multi-agent poetry generation in non-cooperative environments
Ran Zhang
Steffen Eger
LLMAG
29
5
0
05 Sep 2024
LoraMap: Harnessing the Power of LoRA Connections
Hyeryun Park
Jeongwon Kwak
Dongsuk Jang
Sumin Park
Jinwook Choi
MoMe
20
0
0
29 Aug 2024
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations
Yucheng Jiang
Yijia Shao
Dekun Ma
Sina J. Semnani
Monica S. Lam
LLMAG
29
14
0
27 Aug 2024
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation
Samee Arif
Sualeha Farid
Abdul Hameed Azeemi
Awais Athar
Agha Ali Raza
LLMAG
16
7
0
16 Aug 2024
Automated Design of Agentic Systems
Shengran Hu
Cong Lu
Jeff Clune
AI4CE
34
36
0
15 Aug 2024
AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems
Victor C. Dibia
Jingya Chen
Gagan Bansal
Suff Syed
Adam Fourney
Erkang Zhu
Chi Wang
Saleema Amershi
LLMAG
25
5
0
09 Aug 2024
Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate
Yiqun Zhang
Xiaocui Yang
Shi Feng
Daling Wang
Yifei Zhang
Kaisong Song
LLMAG
16
4
0
08 Aug 2024
Jailbreaking Text-to-Image Models with LLM-Based Agents
Yingkai Dong
Zheng Li
Xiangtao Meng
Ning Yu
Shanqing Guo
LLMAG
36
13
0
01 Aug 2024
Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
Taiji Li
Zhi Li
Yin Zhang
HILM
17
5
0
31 Jul 2024
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
Danfeng Guo
Sumitaka Honji
LRM
53
0
0
31 Jul 2024
CityX: Controllable Procedural Content Generation for Unbounded 3D Cities
Shougao Zhang
Mengqi Zhou
Yuxi Wang
Chuanchen Luo
Rongyu Wang
Yiwei Li
Xucheng Yin
Zhaoxiang Zhang
Junran Peng
34
7
0
24 Jul 2024
Building Machines that Learn and Think with People
Katherine M. Collins
Ilia Sucholutsky
Umang Bhatt
Kartik Chandra
Lionel Wong
...
Mark K. Ho
Vikash K. Mansinghka
Adrian Weller
Joshua B. Tenenbaum
Thomas L. Griffiths
40
27
0
22 Jul 2024
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
Apurv Verma
Satyapriya Krishna
Sebastian Gehrmann
Madhavan Seshadri
Anu Pradhan
Tom Ault
Leslie Barrett
David Rabinowitz
John Doucette
Nhathai Phan
47
6
0
20 Jul 2024
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Xun Liang
Shichao Song
Zifan Zheng
Hanyu Wang
Qingchen Yu
...
Rong-Hua Li
Peng Cheng
Zhonghao Wang
Feiyu Xiong
Zhiyu Li
HILM
LRM
56
24
0
19 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
22
32
0
09 Jul 2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
Yangyang Yu
Zhiyuan Yao
Haohang Li
Zhiyang Deng
Yupeng Cao
...
Guojun Xiong
Yueru He
Jimin Huang
Dong Li
Qianqian Xie
AIFin
LLMAG
34
13
0
09 Jul 2024
Automated Justification Production for Claim Veracity in Fact Checking: A Survey on Architectures and Approaches
Islam Eldifrawi
Shengrui Wang
Amine Trabelsi
23
8
0
09 Jul 2024
Collective Innovation in Groups of Large Language Models
Eleni Nisioti
Sebastian Risi
Ida Momennejad
Pierre-Yves Oudeyer
Clément Moulin-Frier
LLMAG
18
3
0
07 Jul 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
34
5
0
05 Jul 2024
On scalable oversight with weak LLMs judging strong LLMs
Zachary Kenton
Noah Y. Siegel
János Kramár
Jonah Brown-Cohen
Samuel Albanie
...
Rishabh Agarwal
David Lindner
Yunhao Tang
Noah D. Goodman
Rohin Shah
ELM
29
28
0
05 Jul 2024
VDMA: Video Question Answering with Dynamically Generated Multi-Agents
Noriyuki Kugo
Tatsuya Ishibashi
Kosuke Ono
Yuji Sato
25
1
0
04 Jul 2024
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control
Yeonji Lee
Sangjun Park
Kyunghyun Cho
Jinyeong Bak
24
1
0
03 Jul 2024
Debate-to-Write: A Persona-Driven Multi-Agent Framework for Diverse Argument Generation
Zhe Hu
Hou Pong Chan
Jing Li
Yu Yin
LLMAG
36
0
0
28 Jun 2024
Previous
1
2
3
4
5
...
8
9
10
Next