ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.01849
  4. Cited By
A Multi-Agent Conversational Bandit Approach to Online Evaluation and Selection of User-Aligned LLM Responses
v1v2 (latest)

A Multi-Agent Conversational Bandit Approach to Online Evaluation and Selection of User-Aligned LLM Responses

3 January 2025
Xiangxiang Dai
Yuejin Xie
Maoli Liu
Xuchuang Wang
Zhuohua Li
Huanyu Wang
J. C. Lui
    LLMAG
ArXiv (abs)PDFHTMLGithub (1★)

Papers citing "A Multi-Agent Conversational Bandit Approach to Online Evaluation and Selection of User-Aligned LLM Responses"

6 / 6 papers shown
Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs
Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Annals), 2025
Wei Yang
Jiacheng Pang
Shixuan Li
P. Bogdan
Stephen Tu
Jesse Thomason
LLMAG
451
9
0
08 Nov 2025
Learning to Deliberate: Meta-policy Collaboration for Agentic LLMs with Multi-agent Reinforcement Learning
Learning to Deliberate: Meta-policy Collaboration for Agentic LLMs with Multi-agent Reinforcement Learning
Wei Yang
Jesse Thomason
255
9
0
04 Sep 2025
ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Cheng Qian
Hongyi Du
Hongru Wang
Xiusi Chen
Yuji Zhang
Avirup Sil
Chengxiang Zhai
Kathleen McKeown
Heng Ji
LLMAG
353
6
0
21 May 2025
Survey: Multi-Armed Bandits Meet Large Language Models
Survey: Multi-Armed Bandits Meet Large Language Models
Djallel Bouneffouf
Raphael Feraud
398
4
0
19 May 2025
SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection
SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection
Chenyang Li
Wenxuan Liu
Guoqiang Gong
Xiaobo Ding
Zhuo Zhou
261
0
0
31 Mar 2025
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning
Yupei Li
M. Milling
Björn Schuller
AI4CE
698
5
0
27 Mar 2025
1
Page 1 of 1