Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.16944
Cited By
Zephyr: Direct Distillation of LM Alignment
25 October 2023
Lewis Tunstall
E. Beeching
Nathan Lambert
Nazneen Rajani
Kashif Rasul
Younes Belkada
Shengyi Huang
Leandro von Werra
Clémentine Fourrier
Nathan Habib
Nathan Sarrazin
Omar Sanseviero
Alexander M. Rush
Thomas Wolf
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Zephyr: Direct Distillation of LM Alignment"
50 / 257 papers shown
Title
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang
Zhihan Liu
Boyi Liu
Y. Zhang
Yingxiang Yang
Y. Liu
Liyu Chen
Tao Sun
Z. Wang
87
2
0
10 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
48
7
0
09 Oct 2024
One2set + Large Language Model: Best Partners for Keyphrase Generation
Liangying Shao
Liang Zhang
Minlong Peng
Guoqi Ma
Hao Yue
Mingming Sun
Jinsong Su
39
0
0
04 Oct 2024
Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback
Kyuyoung Kim
Ah Jeong Seo
Hao Liu
Jinwoo Shin
Kimin Lee
16
2
0
04 Oct 2024
Erasing Conceptual Knowledge from Language Models
Rohit Gandikota
Sheridan Feucht
Samuel Marks
David Bau
KELM
ELM
MU
40
5
0
03 Oct 2024
Strong Preferences Affect the Robustness of Preference Models and Value Alignment
Ziwei Xu
Mohan Kankanhalli
AAML
19
0
0
03 Oct 2024
Generate then Refine: Data Augmentation for Zero-shot Intent Detection
I-Fan Lin
Faegheh Hasibi
Suzan Verberne
VLM
15
2
0
02 Oct 2024
Conversational Exploratory Search of Scholarly Publications Using Knowledge Graphs
Phillip Schneider
Florian Matthes
18
0
0
01 Oct 2024
Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done!
Divya Patel
Pathik Patel
Ankush Chander
Sourish Dasgupta
Tanmoy Chakraborty
16
1
0
30 Sep 2024
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
31
9
0
30 Sep 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Shuhao Chen
Weisen Jiang
Baijiong Lin
James T. Kwok
Yu Zhang
RALM
MQ
40
5
0
30 Sep 2024
Few-shot Prompting for Pairwise Ranking: An Effective Non-Parametric Retrieval Model
Nilanjan Sinhababu
Andrew Parry
Debasis Ganguly
D. Samanta
Pabitra Mitra
29
3
0
26 Sep 2024
Open-World Evaluation for Retrieving Diverse Perspectives
Hung-Ting Chen
Eunsol Choi
30
0
0
26 Sep 2024
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
Jinchuan Zhang
Yan Zhou
Yaxin Liu
Ziming Li
Songlin Hu
AAML
20
3
0
25 Sep 2024
Archon: An Architecture Search Framework for Inference-Time Techniques
Jon Saad-Falcon
Adrian Gamarra Lafuente
Shlok Natarajan
Nahum Maru
Hristo Todorov
...
E. Kelly Buchanan
Mayee Chen
Neel Guha
Christopher Ré
Azalia Mirhoseini
AI4CE
31
19
0
23 Sep 2024
RMCBench: Benchmarking Large Language Models' Resistance to Malicious Code
Jiachi Chen
Qingyuan Zhong
Yanlin Wang
Kaiwen Ning
Yongkun Liu
Zenan Xu
Zhe Zhao
Ting Chen
Zibin Zheng
AAML
15
7
0
23 Sep 2024
Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization
Aseem Srivastava
Smriti Joshi
Tanmoy Chakraborty
Md. Shad Akhtar
19
3
0
23 Sep 2024
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
Daniele Rege Cambrin
Giuseppe Gallipoli
Irene Benedetto
Luca Cagliero
Paolo Garza
23
0
0
20 Sep 2024
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Chen Zhang
Dading Chong
Feng Jiang
Chengguang Tang
Anningzhe Gao
Guohua Tang
Haizhou Li
ALM
29
2
0
20 Sep 2024
Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments
M. Rigaki
C. Catania
Sebastian Garcia
LLMAG
24
3
0
17 Sep 2024
Self-Evolutionary Large Language Models through Uncertainty-Enhanced Preference Optimization
Jianing Wang
Yang Zhou
Xiaocheng Zhang
Mengjiao Bao
Peng Yan
25
1
0
17 Sep 2024
RNR: Teaching Large Language Models to Follow Roles and Rules
Kuan-Chieh Jackson Wang
Alexander Bukharin
Haoming Jiang
Qingyu Yin
Zhengyang Wang
...
Chao Zhang
Bing Yin
Xian Li
Jianshu Chen
Shiyang Li
ALM
21
1
0
10 Sep 2024
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Yong Lin
Skyler Seto
Maartje ter Hoeve
Katherine Metcalf
B. Theobald
Xuan Wang
Yizhe Zhang
Chen Huang
Tong Zhang
31
12
0
05 Sep 2024
LLM Detectors Still Fall Short of Real World: Case of LLM-Generated Short News-Like Posts
Henrique Da Silva Gameiro
Andrei Kucharavy
Ljiljana Dolamic
21
2
0
05 Sep 2024
A Comparative Study on Large Language Models for Log Parsing
Merve Astekin
Max Hort
Leon Moonen
43
2
0
04 Sep 2024
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
Lifan Jiang
Zhihui Wang
Siqi Yin
Guangxiao Ma
Peng Zhang
Boxi Wu
DiffM
51
0
0
28 Aug 2024
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Wenxuan Zhang
Philip H. S. Torr
Mohamed Elhoseiny
Adel Bibi
48
9
0
27 Aug 2024
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Kai Xiong
Xiao Ding
Li Du
Jiahao Ying
Ting Liu
Bing Qin
Yixin Cao
34
1
0
21 Aug 2024
Value Alignment from Unstructured Text
Inkit Padhi
K. Ramamurthy
P. Sattigeri
Manish Nagireddy
Pierre L. Dognin
Kush R. Varshney
24
0
0
19 Aug 2024
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
Yuxin Jiang
Bo Huang
Yufei Wang
Xingshan Zeng
Liangyou Li
Yasheng Wang
Xin Jiang
Lifeng Shang
Ruiming Tang
Wei Wang
42
5
0
14 Aug 2024
Hybrid Student-Teacher Large Language Model Refinement for Cancer Toxicity Symptom Extraction
Reza Khanmohammadi
A. Ghanem
Kyle Verdecchia
Ryan Hall
Mohamed Elshaikh
...
Bing Luo
I. Chetty
Tuka Alhanai
Kundan Thind
Mohammad M. Ghassemi
40
0
0
08 Aug 2024
Better Alignment with Instruction Back-and-Forth Translation
Thao Nguyen
Jeffrey Li
Sewoong Oh
Ludwig Schmidt
Jason Weston
Luke Zettlemoyer
Xian Li
SyDa
27
6
0
08 Aug 2024
DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models
Bowen Wang
Jiuyang Chang
Yiming Qian
Guoxin Chen
Junhao Chen
Zhouqiang Jiang
Jiahao Zhang
Yuta Nakashima
Hajime Nagahara
LM&MA
ELM
LRM
38
3
0
04 Aug 2024
DebateQA: Evaluating Question Answering on Debatable Knowledge
Rongwu Xu
Xuan Qi
Zehan Qi
Wei Xu
Zhijiang Guo
ELM
36
5
0
02 Aug 2024
Automated Software Vulnerability Static Code Analysis Using Generative Pre-Trained Transformer Models
Elijah Pelofske
Vincent Urias
L. Liebrock
33
1
0
31 Jul 2024
Do LLMs Really Adapt to Domains? An Ontology Learning Perspective
Huu Tan Mai
Cuong Xuan Chu
Heiko Paulheim
23
6
0
29 Jul 2024
Self-Directed Synthetic Dialogues and Revisions Technical Report
Nathan Lambert
Hailey Schoelkopf
Aaron Gokaslan
Luca Soldaini
Valentina Pyatkin
Louis Castricato
SyDa
43
3
0
25 Jul 2024
Exploring Description-Augmented Dataless Intent Classification
Ruoyu Hu
Foaad Khosmood
Abbas Edalat
AI4TS
16
0
0
25 Jul 2024
I Could've Asked That: Reformulating Unanswerable Questions
Wenting Zhao
Ge Gao
Claire Cardie
Alexander M. Rush
ELM
17
1
0
24 Jul 2024
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
34
11
0
22 Jul 2024
Adversarial Databases Improve Success in Retrieval-based Large Language Models
Sean Wu
Michael Koo
Li Yo Kao
Andy Black
L. Blum
Fabien Scalzo
Ira Kurtz
RALM
27
0
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedIm
AI4CE
31
0
0
19 Jul 2024
States Hidden in Hidden States: LLMs Emerge Discrete State Representations Implicitly
Junhao Chen
Shengding Hu
Zhiyuan Liu
Maosong Sun
LRM
30
5
0
16 Jul 2024
Empowering Few-Shot Relation Extraction with The Integration of Traditional RE Methods and Large Language Models
Ye Liu
Kai Zhang
Aoran Gan
Linan Yue
Feng Hu
Qi Liu
Enhong Chen
22
0
0
12 Jul 2024
DALL-M: Context-Aware Clinical Data Augmentation with LLMs
Chihcheng Hsieh
Catarina Moreira
Isabel Blanco Nobre
Sandra Costa Sousa
Chun Ouyang
M. Brereton
Joaquim A. Jorge
Jacinto C. Nascimento
41
0
0
11 Jul 2024
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
58
6
1
10 Jul 2024
Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities
Shaltiel Shmidman
Avi Shmidman
Amir DN Cohen
Moshe Koppel
22
2
0
09 Jul 2024
LIONs: An Empirically Optimized Approach to Align Language Models
Xiao Yu
Qingyang Wu
Yu Li
Zhou Yu
ALM
27
3
0
09 Jul 2024
Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments
Roland Daynauth
Jason Mars
ALM
20
0
0
05 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALM
KELM
44
20
0
02 Jul 2024
Previous
1
2
3
4
5
6
Next