Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2402.00530
Cited By
v1
v2 (latest)
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
1 February 2024
Ming Li
Yong Zhang
Shwai He
Zhitao Li
Hongyu Zhao
Jianzong Wang
Ning Cheng
Wanrong Zhu
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (188★)
Papers citing
"Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning"
50 / 56 papers shown
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Cehao Yang
Xueyuan Lin
Chengjin Xu
Xuhui Jiang
Xiaojun Wu
Honghao Liu
Hui Xiong
Jian Guo
LRM
361
4
0
24 Dec 2025
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
Sultan AlRashed
Chadi Helwe
Francesco Orabona
MoE
135
0
0
23 Nov 2025
Revisiting the Data Sampling in Multimodal Post-training from a Difficulty-Distinguish View
Jianyu Qi
Ding Zou
Wenrui Yan
Rui Ma
Jiaxu Li
Zhijie Zheng
Zhiguo Yang
Rongchang Zhao
LRM
310
0
0
10 Nov 2025
Importance-Aware Data Selection for Efficient LLM Instruction Tuning
Tingyu Jiang
Shen Li
Yiyao Song
Lan Zhang
Hualei Zhu
Yuan Zhao
Xiaohang Xu
Kenjiro Taura
Hao Henry Wang
428
5
0
10 Nov 2025
Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains
P. Wang
Hongcheng Liu
Yusheng Liao
Ziqing Fan
Yaxin Du
Shuo Tang
Y. Wang
Y Samuel Wang
169
2
0
10 Nov 2025
BOTS: A Unified Framework for Bayesian Online Task Selection in LLM Reinforcement Finetuning
Qianli Shen
Daoyuan Chen
Yilun Huang
Zhenqing Ling
Yaliang Li
Bolin Ding
Jingren Zhou
OffRL
228
4
0
30 Oct 2025
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Cancheng Zhang
Xiangdong Zhang
Mingquan Feng
Jingzhi Wang
Junchi Yan
178
2
0
21 Oct 2025
Alibaba International E-commerce Product Search Competition DILAB Team Technical Report
Hyewon Lee
Junghyun Oh
Minkyung Song
SoYoung Park
Seunghoon Han
126
0
0
21 Oct 2025
On the Role of Preference Variance in Preference Optimization
Jiacheng Guo
Zihao Li
Jiahao Qiu
Yue Wu
Mengdi Wang
207
3
0
14 Oct 2025
Does Weak-to-strong Generalization Happen under Spurious Correlations?
Chenruo Liu
Yijun Dong
Qi Lei
195
0
0
28 Sep 2025
Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory
Ming Li
Nan Zhang
Chenrui Fan
Hong Jiao
Yanbin Fu
Sydney Peters
Qingshu Xu
Robert Lissitz
Tianyi Zhou
LRM
192
7
0
18 Sep 2025
GRAM-R
2
^2
2
: Self-Training Generative Foundation Reward Models for Reward Reasoning
Chenglong Wang
Yongyu Mu
Hang Zhou
Yifu Huo
Ziming Zhu
...
Tong Xiao
Xiaoyang Hao
Chunliang Zhang
Fandong Meng
Jingbo Zhu
OffRL
LRM
VLM
389
1
0
02 Sep 2025
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang
Xin Gao
Qizhi Pei
Zhuoshi Pan
Mengzhang Cai
Jiang Wu
Conghui He
Lijun Wu
SyDa
392
3
0
29 Aug 2025
Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Yanbo Dai
Zhenlan Ji
Zongjie Li
Kuan Li
Shuai Wang
SILM
AAML
KELM
220
2
0
27 Aug 2025
Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation
Tianjun Wei
Huizhong Guo
Yingpeng Du
Zhu Sun
Chen Huang
Dongxia Wang
Jie Zhang
ALM
378
2
0
25 Aug 2025
Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations
Peng Lai
Jianjie Zheng
Sijie Cheng
Yun-Nung Chen
Peng Li
Yang Liu
Guanhua Chen
287
3
0
05 Aug 2025
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Runpeng Yu
Qi Li
Xinchao Wang
DiffM
AI4CE
621
39
0
16 Jun 2025
Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models
Jijie Li
Li Du
hanyu Zhao
Bo Zhang
Liangdong Wang
Boyan Gao
Guang Liu
Yonghua Lin
ALM
SyDa
228
37
0
09 Jun 2025
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
Yanjun Fu
Faisal Hamman
Sanghamitra Dutta
ALM
458
8
0
02 Jun 2025
GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yi Jiang
Sendong Zhao
Jianbo Li
Haochun Wang
Bing Qin
RALM
460
9
0
24 May 2025
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
Yanggan Gu
Zhaoyi Yan
Yuanyi Wang
Yiming Zhang
Qi Zhou
Leilei Gan
Hongxia Yang
386
3
0
20 May 2025
ProDS: Preference-oriented Data Selection for Instruction Tuning
Wenya Guo
Zhengkun Zhang
Xumeng Liu
Ying Zhang
Ziyu Lu
Haoze Zhu
Xubo Liu
Ruxue Yan
330
1
0
19 May 2025
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
Weixuan Wang
Minghao Wu
Barry Haddow
Alexandra Birch
608
2
0
18 May 2025
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Ming Li
Yongqian Li
Ziyue Li
Tianyi Zhou
LRM
375
8
0
14 Apr 2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Xinze Wang
Zhiyong Yang
Chao Feng
Hongjin Lu
Linjie Li
Chung-Ching Lin
Kevin Qinghong Lin
Furong Huang
Lijuan Wang
OODD
ReLM
LRM
VLM
680
107
0
10 Apr 2025
Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning
Peiyi Lin
Fukai Zhang
Kai Niu
Hao Fu
CLL
372
0
0
20 Mar 2025
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Jia Zhang
Chen-Xi Zhang
Wenshu Fan
Yi-Xuan Jin
Xiao-Wen Yang
Bo Zheng
Yi Liu
Lan-Zhe Guo
437
15
0
14 Mar 2025
ATLaS: Agent Tuning via Learning Critical Steps
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhixun Chen
Ming Li
Yuanmin Huang
Yali Du
Meng Fang
Wanrong Zhu
669
17
0
04 Mar 2025
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Hui Yuan
Yuhao Du
Xiaoqi Jiao
Yiwen Guo
Yuege Feng
Xiang Wan
Anningzhe Gao
Jinpeng Hu
449
7
0
04 Mar 2025
CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom
Yisen Li
Lingfeng Yang
Wenxuan Shen
Pan Zhou
Yao Wan
Weiwei Lin
Benlin Liu
334
5
0
03 Mar 2025
Large-Scale Data Selection for Instruction Tuning
Michal Guerquin
Muru Zhang
Faeze Brahman
Pang Wei Koh
Pradeep Dasigi
ALM
426
23
0
03 Mar 2025
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
Hongyi Cal
Jie Li
Mohammad Mahdinur Rahman
Wenzhen Dong
460
1
0
26 Feb 2025
MergeIT: From Selection to Merging for Efficient Instruction Tuning
Hongyi Cai
Yuqian Fu
Hongming Fu
Bo Zhao
MoMe
427
1
0
25 Feb 2025
From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ruxiao Chen
Chenguang Wang
Yuran Sun
Xilei Zhao
Susu Xu
536
12
0
24 Feb 2025
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
Hantao Lou
Changye Li
Yalan Qin
Yaodong Yang
445
12
0
22 Feb 2025
Unhackable Temporal Rewarding for Scalable Video MLLMs
En Yu
Kangheng Lin
Liang Zhao
Yana Wei
Zining Zhu
...
Jianjian Sun
Zheng Ge
Xinsong Zhang
Jingyu Wang
Wenbing Tao
311
26
0
17 Feb 2025
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Transactions of the Association for Computational Linguistics (TACL), 2024
Jing Yang
Max Glockner
Anderson de Rezende Rocha
Iryna Gurevych
LRM
485
1
0
07 Feb 2025
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
641
32
0
06 Feb 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
940
27
0
31 Dec 2024
ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation
International Conference on Computational Linguistics (COLING), 2024
Weilong Dong
Xinwei Wu
Renren Jin
Shaoyang Xu
Deyi Xiong
361
11
0
31 Dec 2024
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yuchen Fan
Yuzhong Hong
Qiushi Wang
Junwei Bao
Hongfei Jiang
Yang Song
340
7
0
17 Dec 2024
Stronger Models are NOT Stronger Teachers for Instruction Tuning
Zhangchen Xu
Fengqing Jiang
Luyao Niu
Bill Yuchen Lin
Radha Poovendran
ALM
468
15
0
11 Nov 2024
Weak-to-Strong Generalization beyond Accuracy: a Pilot Study in Safety, Toxicity, and Legal Reasoning
Ruimeng Ye
Yang Xiao
Bo Hui
ALM
ELM
OffRL
369
6
0
16 Oct 2024
Mastering the Craft of Data Synthesis for CodeLLMs
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Meng Chen
Philip Arthur
Qianyu Feng
Cong Duy Vu Hoang
Yu-Heng Hong
...
Mark Johnson
Kemal Kurniawan
Don Dharmasiri
Long Duong
Yuan-Fang Li
SyDa
727
4
0
16 Oct 2024
Federated Data-Efficient Instruction Tuning for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhen Qin
Zhaomin Wu
Bingsheng He
Shuiguang Deng
FedML
424
8
0
14 Oct 2024
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Tingchen Fu
Mrinank Sharma
Juil Sock
Shay B. Cohen
David M. Krueger
Fazl Barez
AAML
589
29
0
11 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
International Conference on Learning Representations (ICLR), 2024
Yougang Lyu
Lingyong Yan
Zihan Wang
D. Yin
Sudipta Singha Roy
Maarten de Rijke
Zhaochun Ren
677
21
0
10 Oct 2024
Your Weak LLM is Secretly a Strong Teacher for Alignment
International Conference on Learning Representations (ICLR), 2024
Leitian Tao
Yixuan Li
631
16
0
13 Sep 2024
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm
Yiming Liang
Ge Zhang
Xingwei Qu
Tianyu Zheng
Jiawei Guo
...
Jiaheng Liu
Chenghua Lin
Lei Ma
Wenhao Huang
Jiajun Zhang
ALM
328
23
0
15 Aug 2024
RuleR: Improving LLM Controllability by Rule-based Data Recycling
Ming Li
Han Chen
Chenguang Wang
Dang Nguyen
Dianqi Li
Wanrong Zhu
678
14
0
22 Jun 2024
1
2
Next
Page 1 of 2