ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.10093
  4. Cited By
How to Leverage Demonstration Data in Alignment for Large Language
  Model? A Self-Imitation Learning Perspective

How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective

14 October 2024
Teng Xiao
Mingxiao Li
Yige Yuan
Huaisheng Zhu
Chao Cui
V. Honavar
    ALM
ArXivPDFHTML

Papers citing "How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective"

3 / 3 papers shown
Title
Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems
Mahfuz Ahmed Anik
Abdur Rahman
Azmine Toushik Wasi
Md Manjurul Ahsan
47
1
0
05 Mar 2025
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Teng Xiao
Yige Yuan
Z. Chen
Mingxiao Li
Shangsong Liang
Z. Ren
V. Honavar
90
5
0
21 Feb 2025
MITA: Bridging the Gap between Model and Data for Test-time Adaptation
MITA: Bridging the Gap between Model and Data for Test-time Adaptation
Yige Yuan
Bingbing Xu
Teng Xiao
Liang Hou
Fei Sun
Huawei Shen
Xueqi Cheng
TTA
33
0
0
12 Oct 2024
1