ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14325
  4. Cited By
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate

Improving Factuality and Reasoning in Language Models through Multiagent Debate

23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
    LLMAG
    LRM
ArXivPDFHTML

Papers citing "Improving Factuality and Reasoning in Language Models through Multiagent Debate"

50 / 453 papers shown
Title
STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification
STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification
Siyi Du
Xinzhe Luo
D. O’Regan
Chen Qin
62
0
0
08 Mar 2025
Research on Superalignment Should Advance Now with Parallel Optimization of Competence and Conformity
HyunJin Kim
Xiaoyuan Yi
Jing Yao
Muhua Huang
Jinyeong Bak
James Evans
Xing Xie
34
0
0
08 Mar 2025
Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models
Panatchakorn Anantaprayoon
Masahiro Kaneko
Naoaki Okazaki
LRM
KELM
50
0
0
08 Mar 2025
QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation
Bang Nguyen
Tingting Du
Mengxia Yu
Lawrence Angrave
Meng-Long Jiang
AI4Ed
64
0
0
07 Mar 2025
Extracting and Emulsifying Cultural Explanation to Improve Multilingual Capability of LLMs
Hamin Koo
Jaehyung Kim
36
0
0
07 Mar 2025
Enhancing Reasoning with Collaboration and Memory
Julie Michelman
Nasrin Baratalipour
Matthew Abueg
LLMAG
FedML
61
1
0
07 Mar 2025
Evaluating open-source Large Language Models for automated fact-checking
Nicoló Fontana
Francesco Corso
Enrico Zuccolotto
Francesco Pierri
HILM
54
0
0
07 Mar 2025
Efficient Algorithms for Verifying Kruskal Rank in Sparse Linear Regression and Related Applications
Fengqin Zhou
41
3
0
06 Mar 2025
AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management
Junyuan Mao
Fanci Meng
Yifan Duan
Miao Yu
X. Jia
Junfeng Fang
Yuxuan Liang
K. Wang
Qingsong Wen
LLMAG
AAML
37
1
0
06 Mar 2025
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
Zichong Li
Xinyu Feng
Yuheng Cai
Zixuan Zhang
Tianyi Liu
Chen Liang
Weizhu Chen
Haoyu Wang
T. Zhao
LRM
48
1
0
06 Mar 2025
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Rui Ye
Shuo Tang
Rui Ge
Yaxin Du
Zhenfei Yin
S. Chen
Jing Shao
LLMAG
85
1
0
05 Mar 2025
Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
Zicong He
Boxuan Zhang
Lu Cheng
47
0
0
04 Mar 2025
Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction
Liping Liu
Chunhong Zhang
Likang Wu
Chuang Zhao
Zheng Hu
Ming He
Jianping Fan
LLMAG
LRM
36
0
0
02 Mar 2025
Rehearse With User: Personalized Opinion Summarization via Role-Playing based on Large Language Models
Yanyue Zhang
Yulan He
Deyu Zhou
31
0
0
01 Mar 2025
PodAgent: A Comprehensive Framework for Podcast Generation
Yujia Xiao
Lei He
Haohan Guo
Fenglong Xie
Tan Lee
34
0
0
01 Mar 2025
The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents
The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents
Yifan Duan
Yihong Tang
Xuefeng Bai
Kehai Chen
J. Li
Min Zhang
LLMAG
84
0
0
28 Feb 2025
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
Shalev Lifshitz
Sheila A. McIlraith
Yilun Du
LRM
44
4
0
27 Feb 2025
LangProBe: a Language Programs Benchmark
LangProBe: a Language Programs Benchmark
Shangyin Tan
Lakshya A Agrawal
Arnav Singhvi
Liheng Lai
Michael J Ryan
Dan Klein
Omar Khattab
Koushik Sen
Matei A. Zaharia
64
0
0
27 Feb 2025
Stay Focused: Problem Drift in Multi-Agent Debate
Stay Focused: Problem Drift in Multi-Agent Debate
Jonas Becker
Lars Benedikt Kaesberg
Andreas Stephan
Jan Philip Wahle
Terry Ruas
Bela Gipp
47
1
0
26 Feb 2025
Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning
Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning
Yanan Chen
Ali Pesaranghader
Tanmana Sadhu
LRM
54
0
0
26 Feb 2025
Multi-LLM Collaborative Search for Complex Problem Solving
Multi-LLM Collaborative Search for Complex Problem Solving
Sen Yang
Yafu Li
Wai Lam
Yu Cheng
LLMAG
LRM
66
1
0
26 Feb 2025
Voting or Consensus? Decision-Making in Multi-Agent Debate
Voting or Consensus? Decision-Making in Multi-Agent Debate
Lars Benedikt Kaesberg
Jonas Becker
Jan Philip Wahle
Terry Ruas
Bela Gipp
58
0
0
26 Feb 2025
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble
Zhijun Chen
Jingzheng Li
Pengpeng Chen
Zhuoran Li
Kai Sun
Yuankai Luo
Qianren Mao
Dingqi Yang
Hailong Sun
Philip S. Yu
ELM
50
2
0
25 Feb 2025
Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT
Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT
Hediyeh Baban
Sai A Pidapar
Aashutosh Nema
Sichen Lu
LLMAG
67
0
0
25 Feb 2025
The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?
The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?
Zhenheng Tang
Xiang Liu
Qian Wang
Peijie Dong
Bingsheng He
Xiaowen Chu
Bo Li
LRM
50
1
0
24 Feb 2025
METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling
METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling
Bingxuan Li
Yiwei Wang
Jiuxiang Gu
Kai-Wei Chang
Nanyun Peng
AI4CE
57
3
0
24 Feb 2025
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models
Shunchang Liu
Zhuan Shi
Lingjuan Lyu
Yaochu Jin
Boi Faltings
55
2
0
24 Feb 2025
MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions
MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions
Yuxuan Liu
Hongda Sun
Wei Liu
Jian Luan
Bo Du
Rui Yan
48
1
0
24 Feb 2025
RewardDS: Privacy-Preserving Fine-Tuning for Large Language Models via Reward Driven Data Synthesis
RewardDS: Privacy-Preserving Fine-Tuning for Large Language Models via Reward Driven Data Synthesis
Jianwei Wang
Junyao Yang
Haoran Li
Huiping Zhuang
Cen Chen
Ziqian Zeng
SyDa
35
0
0
23 Feb 2025
M-MAD: Multidimensional Multi-Agent Debate for Advanced Machine Translation Evaluation
M-MAD: Multidimensional Multi-Agent Debate for Advanced Machine Translation Evaluation
Zhaopeng Feng
Jiayuan Su
Jiamei Zheng
Jiahan Ren
Yan Zhang
Jian Wu
Hongwei Wang
Zuozhu Liu
ELM
198
0
0
21 Feb 2025
S^3cMath: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
S^3cMath: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Yuchen Yan
Jin Jiang
Yang Liu
Yixin Cao
Xin Xu
M. Zhang
Xunliang Cai
Jian Shao
ReLM
LRM
KELM
110
7
0
21 Feb 2025
Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations
Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations
Lihu Chen
Shuojie Fu
Gabriel Freedman
Cemre Zor
Guy Martin
James Kinross
Uddhav Vaghela
Ovidiu Serban
Francesca Toni
DeLMO
63
0
0
21 Feb 2025
Autellix: An Efficient Serving Engine for LLM Agents as General Programs
Autellix: An Efficient Serving Engine for LLM Agents as General Programs
Michael Luo
Xiaoxiang Shi
Colin Cai
Tianjun Zhang
Justin Wong
...
Chi Wang
Yanping Huang
Zhifeng Chen
Joseph E. Gonzalez
Ion Stoica
47
2
0
20 Feb 2025
Optimizing Model Selection for Compound AI Systems
Optimizing Model Selection for Compound AI Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Matei A. Zaharia
James Y. Zou
Ion Stoica
48
0
0
20 Feb 2025
Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language Models
Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language Models
Jongho Kim
Seung-won Hwang
LRM
AI4CE
51
0
0
17 Feb 2025
Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
Zeqing Wang
Wentao Wan
Qiqing Lao
Runmeng Chen
Minjie Lang
Keze Wang
Liang Lin
Liang Lin
LRM
92
3
0
17 Feb 2025
Divergent Thoughts toward One Goal: LLM-based Multi-Agent Collaboration System for Electronic Design Automation
Divergent Thoughts toward One Goal: LLM-based Multi-Agent Collaboration System for Electronic Design Automation
Haoyuan Wu
Haisheng Zheng
Zhuolun He
Bei Yu
33
0
0
15 Feb 2025
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology
Fatemeh Ghezloo
M. S. Seyfioglu
Rustin Soraki
Wisdom O. Ikezogwo
Beibin Li
Tejoram Vivekanandan
J. Elmore
Ranjay Krishna
Linda G. Shapiro
86
2
0
13 Feb 2025
LLMs can implicitly learn from mistakes in-context
LLMs can implicitly learn from mistakes in-context
Lisa Alazraki
Maximilian Mozes
Jon Ander Campos
Yi Chern Tan
Marek Rei
Max Bartolo
ReLM
LRM
88
0
0
12 Feb 2025
EvoFlow: Evolving Diverse Agentic Workflows On The Fly
EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Guibin Zhang
Kaijie Chen
Guancheng Wan
Heng Chang
Hong Cheng
K. Wang
Shuyue Hu
Lei Bai
69
2
0
11 Feb 2025
Don't Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification
Don't Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification
Peipei Wei
Dimitris Dimitriadis
Yan Xu
Mingwei Shen
55
0
0
11 Feb 2025
ConMeC: A Dataset for Metonymy Resolution with Common Nouns
ConMeC: A Dataset for Metonymy Resolution with Common Nouns
Saptarshi Ghosh
Tianyu Jiang
80
0
0
10 Feb 2025
Preventing Rogue Agents Improves Multi-Agent Collaboration
Preventing Rogue Agents Improves Multi-Agent Collaboration
Ohav Barbi
Ori Yoran
Mor Geva
48
1
0
09 Feb 2025
Multi-agent Architecture Search via Agentic Supernet
Multi-agent Architecture Search via Agentic Supernet
Guibin Zhang
Luyang Niu
Junfeng Fang
K. Wang
Lei Bai
X. Wang
88
3
0
06 Feb 2025
Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement
Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement
Soheil Abbasloo
LRM
39
0
0
04 Feb 2025
PSSD: Making Large Language Models Self-denial via Human Psyche Structure
PSSD: Making Large Language Models Self-denial via Human Psyche Structure
Jinzhi Liao
Zenghua Liao
Xiang Zhao
LRM
LLMAG
43
0
0
03 Feb 2025
RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Can Jin
Hongwu Peng
Anxiang Zhang
Nuo Chen
Jiahui Zhao
...
K. Li
Shuya Feng
Kai Zhong
Caiwen Ding
Dimitris N. Metaxas
99
2
0
02 Feb 2025
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Wenzhe Li
Yong Lin
Mengzhou Xia
Chi Jin
MoE
74
2
0
02 Feb 2025
MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-Processing
MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-Processing
Yuxuan Chen
Xu Zhu
Hua Zhou
Zhuyin Ren
AI4CE
39
3
0
01 Feb 2025
GuardReasoner: Towards Reasoning-based LLM Safeguards
Yue Liu
Hongcheng Gao
Shengfang Zhai
Jun-Xiong Xia
Tianyi Wu
Zhiwei Xue
Y. Chen
Kenji Kawaguchi
Jiaheng Zhang
Bryan Hooi
AI4TS
LRM
120
13
0
30 Jan 2025
Previous
12345...8910
Next