Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.10093
Cited By
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective
14 October 2024
Teng Xiao
Mingxiao Li
Yige Yuan
Huaisheng Zhu
Chao Cui
V. Honavar
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective"
3 / 3 papers shown
Title
Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems
Mahfuz Ahmed Anik
Abdur Rahman
Azmine Toushik Wasi
Md Manjurul Ahsan
47
1
0
05 Mar 2025
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Teng Xiao
Yige Yuan
Z. Chen
Mingxiao Li
Shangsong Liang
Z. Ren
V. Honavar
90
5
0
21 Feb 2025
MITA: Bridging the Gap between Model and Data for Test-time Adaptation
Yige Yuan
Bingbing Xu
Teng Xiao
Liang Hou
Fei Sun
Huawei Shen
Xueqi Cheng
TTA
33
0
0
12 Oct 2024
1