Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2501.01849
Cited By
v1
v2 (latest)
A Multi-Agent Conversational Bandit Approach to Online Evaluation and Selection of User-Aligned LLM Responses
3 January 2025
Xiangxiang Dai
Yuejin Xie
Maoli Liu
Xuchuang Wang
Zhuohua Li
Huanyu Wang
J. C. Lui
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1★)
Papers citing
"A Multi-Agent Conversational Bandit Approach to Online Evaluation and Selection of User-Aligned LLM Responses"
6 / 6 papers shown
Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Annals), 2025
Wei Yang
Jiacheng Pang
Shixuan Li
P. Bogdan
Stephen Tu
Jesse Thomason
LLMAG
451
9
0
08 Nov 2025
Learning to Deliberate: Meta-policy Collaboration for Agentic LLMs with Multi-agent Reinforcement Learning
Wei Yang
Jesse Thomason
255
9
0
04 Sep 2025
ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Cheng Qian
Hongyi Du
Hongru Wang
Xiusi Chen
Yuji Zhang
Avirup Sil
Chengxiang Zhai
Kathleen McKeown
Heng Ji
LLMAG
353
6
0
21 May 2025
Survey: Multi-Armed Bandits Meet Large Language Models
Djallel Bouneffouf
Raphael Feraud
398
4
0
19 May 2025
SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection
Chenyang Li
Wenxuan Liu
Guoqiang Gong
Xiaobo Ding
Zhuo Zhou
261
0
0
31 Mar 2025
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning
Yupei Li
M. Milling
Björn Schuller
AI4CE
698
5
0
27 Mar 2025
1
Page 1 of 1