ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.13447
  4. Cited By
AQA: Adaptive Question Answering in a Society of LLMs via Contextual
  Multi-Armed Bandit

AQA: Adaptive Question Answering in a Society of LLMs via Contextual Multi-Armed Bandit

20 September 2024
Mohanna Hoveyda
A. D. Vries
Maarten de Rijke
Harrie Oosterhuis
Faegheh Hasibi
ArXivPDFHTML

Papers citing "AQA: Adaptive Question Answering in a Society of LLMs via Contextual Multi-Armed Bandit"

1 / 1 papers shown
Title
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
55
6
0
10 Oct 2024
1