Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2204.08499
Cited By
v1
v2
v3 (latest)
DeepCore: A Comprehensive Library for Coreset Selection in Deep Learning
International Conference on Database and Expert Systems Applications (DEXA), 2022
18 April 2022
Chengcheng Guo
B. Zhao
Yanbing Bai
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DeepCore: A Comprehensive Library for Coreset Selection in Deep Learning"
50 / 63 papers shown
Title
VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction
Shaobo Wang
Tianle Niu
Runkang Yang
Deshan Liu
Xu He
Zichen Wen
Conghui He
Xuming Hu
Linfeng Zhang
VGen
154
0
0
24 Nov 2025
BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation
Rachit Saluja
Asli Cihangir
Ruining Deng
Johannes C. Paetzold
Fengbei Liu
M. Sabuncu
MedIm
172
0
0
24 Nov 2025
UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective
Furui Xu
Shaobo Wang
J. Zhang
Chenghao Sun
Haixiang Tang
Linfeng Zhang
80
0
0
17 Nov 2025
A Feedback-Control Framework for Efficient Dataset Collection from In-Vehicle Data Streams
Philipp Reis
Philipp Rigoll
Christian Steinhauser
Jacob Langner
Eric Sax
109
0
0
05 Nov 2025
Adaptive Data Selection for Multi-Layer Perceptron Training: A Sub-linear Value-Driven Method
Xiyang Zhang
Chen Liang
Haoxuan Qiu
Hongzhi Wang
96
0
0
24 Oct 2025
Performance-Efficiency Trade-off for Fashion Image Retrieval
Julio Hurtado
Haoran Ni
Duygu Sap
Connor Mattinson
Martin Lotz
80
0
0
29 Sep 2025
HyperCore: Coreset Selection under Noise via Hypersphere Models
Brian B. Moser
Arundhati S. Shanbhag
Tobias Christian Nauen
Stanislav Frolov
Federico Raue
Joachim Folz
Andreas Dengel
79
0
0
26 Sep 2025
SubZeroCore: A Submodular Approach with Zero Training for Coreset Selection
Brian B. Moser
Tobias Christian Nauen
Arundhati S. Shanbhag
Federico Raue
Stanislav Frolov
Joachim Folz
Andreas Dengel
116
0
0
26 Sep 2025
Coreset selection based on Intra-class diversity
Imran Ashraf
Mukhtar Ullah
Muhammad Faisal Nadeem
Muhammad Nouman Noor
97
0
0
23 Sep 2025
\emph{FoQuS}: A Forgetting-Quality Coreset Selection Framework for Automatic Modulation Recognition
Yao Lu
Chunfeng Sun
Dongwei Xu
Yun Lin
Qi Xuan
Guan Gui
75
0
0
10 Sep 2025
Coresets from Trajectories: Selecting Data via Correlation of Loss Differences
M. Nagaraj
Deepak Ravikumar
Kaushik Roy
147
2
0
27 Aug 2025
Two-Stage Framework for Efficient UAV-Based Wildfire Video Analysis with Adaptive Compression and Fire Source Detection
Yanbing Bai
Rui-Yang Ju
Lemeng Zhao
Junjie Hu
Jianchao Bi
Erick Mas
Shunichi Koshimura
78
0
0
22 Aug 2025
Dataset Condensation with Color Compensation
Huyu Wu
Duo Su
Junjie Hou
Guang Li
DD
322
0
0
02 Aug 2025
Decouple before Align: Visual Disentanglement Enhances Prompt Tuning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Fei Zhang
Tianfei Zhou
Jiangchao Yao
Ya Zhang
Ivor W. Tsang
Yanfeng Wang
176
5
0
01 Aug 2025
PICore: Physics-Informed Unsupervised Coreset Selection for Data Efficient Neural Operator Training
Anirudh Satheesh
Anant Khandelwal
Mucong Ding
Radu Balan
AI4CE
96
0
0
23 Jul 2025
The Impact of Coreset Selection on Spurious Correlations and Group Robustness
Amaya Dharmasiri
William Yang
Polina Kirichenko
Lydia Liu
Olga Russakovsky
84
1
0
15 Jul 2025
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection
Zhijing Wan
Zhixiang Wang
Zheng Wang
Xin Xu
Shiníchi Satoh
225
1
0
17 Jun 2025
Effective Data Pruning through Score Extrapolation
Sebastian Schmidt
Prasanga Dhungel
Christoffer Löffler
Bjorn Nieth
Stephan Günnemann
Leo Schwinn
SyDa
250
1
0
10 Jun 2025
Large Language Models are Demonstration Pre-Selectors for Themselves
Jiarui Jin
Yuwei Wu
Haoxuan Li
Xiaoting He
Weinan Zhang
Y. Yang
Yong Yu
Jun Wang
Mengyue Yang
239
2
0
06 Jun 2025
GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems
Tiehua Mei
Hengrui Chen
Peng Yu
Jiaqing Liang
Deqing Yang
278
0
0
04 Jun 2025
A Coreset Selection of Coreset Selection Literature: Introduction and Recent Advances
Brian B. Moser
Arundhati S. Shanbhag
Stanislav Frolov
Federico Raue
Joachim Folz
Andreas Dengel
400
7
0
23 May 2025
DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Zekai Li
Xinhao Zhong
Samir Khaki
Zhiyuan Liang
Yuhao Zhou
...
Konstantinos N Plataniotis
Zinan Lin
Bo Zhao
Yang You
Kai Wang
DD
564
9
0
19 May 2025
Evolution Meets Diffusion: Efficient Neural Architecture Generation
Bingye Zhou
Caiyang Yu
Chenwei Tang
DiffM
523
0
0
24 Apr 2025
PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel Similarity
Mustafa Burak Gurbuz
Xingyu Zheng
C. Dovrolis
OOD
516
0
0
07 Apr 2025
A Large-Scale Study on Video Action Dataset Condensation
Yang Chen
Sheng Guo
Bo Zheng
Limin Wang
DD
380
6
0
13 Mar 2025
PersonaX: A Recommendation Agent Oriented User Modeling Framework for Long Behavior Sequence
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yunxiao Shi
Wujiang Xu
Zeqi Zhang
Xing Zi
Qiang Wu
Min Xu
311
9
0
04 Mar 2025
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
Lakshmi Nair
Ian Trase
Mark Kim
AIFin
LRM
AI4CE
285
2
0
18 Feb 2025
Does Training with Synthetic Data Truly Protect Privacy?
International Conference on Learning Representations (ICLR), 2025
Yunpeng Zhao
Jie Zhang
279
4
0
18 Feb 2025
InsBank: Evolving Instruction Subset for Ongoing Alignment
Jiayi Shi
Yiwei Li
Shaoxiong Feng
Peiwen Yuan
Xiaobei Wang
...
Chuyi Tan
Boyuan Pan
Huan Ren
Yao Hu
Kan Li
ALM
286
0
0
17 Feb 2025
On Learning Representations for Tabular Data Distillation
Inwon Kang
Parikshit Ram
Yi Zhou
Horst Samulowitz
Oshani Seneviratne
DD
224
0
0
23 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
687
21
0
31 Dec 2024
TAROT: Targeted Data Selection via Optimal Transport
Lan Feng
Fan Nie
Yuejiang Liu
Alexandre Alahi
OT
453
2
0
30 Nov 2024
Efficient Alignment of Large Language Models via Data Sampling
Amrit Khera
Rajat Ghosh
Debojyoti Dutta
410
1
0
15 Nov 2024
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kiran Purohit
Venktesh V
Raghuram Devalla
Krishna Mohan Yerragorla
Sourangshu Bhattacharya
Avishek Anand
LRM
212
4
0
06 Nov 2024
Efficient Biological Data Acquisition through Inference Set Design
International Conference on Learning Representations (ICLR), 2024
Ihor Neporozhnii
Julien Roy
Emmanuel Bengio
Jason Hartford
263
2
0
25 Oct 2024
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Jinghan Jia
Jiancheng Liu
Yihua Zhang
Parikshit Ram
Nathalie Baracaldo
Sijia Liu
MU
337
15
0
23 Oct 2024
Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
International Conference on Learning Representations (ICLR), 2024
Tianchi Xie
Jiangning Zhu
Guozu Ma
Minzhi Lin
Wei Chen
Weikai Yang
Shixia Liu
336
2
0
03 Oct 2024
Efficient Data Subset Selection to Generalize Training Across Models: Transductive and Inductive Networks
Neural Information Processing Systems (NeurIPS), 2024
Eeshaan Jain
Tushar Nandy
Gaurav Aggarwal
Ashish Tendulkar
Rishabh K. Iyer
A. De
180
20
0
18 Sep 2024
Diverse Subset Selection via Norm-Based Sampling and Orthogonality
Noga Bar
Raja Giryes
CVBM
310
1
0
03 Jun 2024
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Zachary Ankner
Cody Blakeney
Kartik K. Sreenivasan
Max Marion
Matthew L. Leavitt
Mansheej Paul
271
61
0
30 May 2024
Spectral Greedy Coresets for Graph Neural Networks
Mucong Ding
Yinhan He
Jundong Li
Furong Huang
178
3
0
27 May 2024
Distilling the Knowledge in Data Pruning
Emanuel Ben-Baruch
Adam Botach
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
171
2
0
12 Mar 2024
Unlocking Dataset Distillation with Diffusion Models
Brian B. Moser
Federico Raue
Sebastián M. Palacio
Stanislav Frolov
Andreas Dengel
DD
494
23
0
06 Mar 2024
How to Train Data-Efficient LLMs
Noveen Sachdeva
Benjamin Coleman
Wang-Cheng Kang
Jianmo Ni
Lichan Hong
Ed H. Chi
James Caverlee
Julian McAuley
D. Cheng
215
89
0
15 Feb 2024
Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap
International Conference on Learning Representations (ICLR), 2024
Christopher Liao
Christian So
Theodoros Tsiligkaridis
Brian Kulis
285
1
0
06 Feb 2024
Sketch and shift: a robust decoder for compressive clustering
Ayoub Belhadji
Rémi Gribonval
200
2
0
15 Dec 2023
D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning
A. Maharana
Prateek Yadav
Mohit Bansal
248
46
0
11 Oct 2023
Exploring Data Redundancy in Real-world Image Classification through Data Selection
Zhenyu Tang
Shaoting Zhang
Xiaosong Wang
145
3
0
25 Jun 2023
Large-scale Dataset Pruning with Dynamic Uncertainty
Muyang He
Shuo Yang
Tiejun Huang
Bo Zhao
295
50
0
08 Jun 2023
Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning
International Conference on Learning Representations (ICLR), 2023
Patrik Okanovic
R. Waleffe
Vasilis Mageirakos
Konstantinos E. Nikolakakis
Amin Karbasi
Dionysis Kalogerias
Nezihe Merve Gürel
Theodoros Rekatsinas
DD
209
23
0
28 May 2023
1
2
Next