Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2006.15412
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Submodular Combinatorial Information Measures with Applications in Machine Learning
27 June 2020
Rishabh K. Iyer
Ninad Khargoankar
J. Bilmes
Himanshu Asanani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Submodular Combinatorial Information Measures with Applications in Machine Learning"
50 / 66 papers shown
SubZeroCore: A Submodular Approach with Zero Training for Coreset Selection
Brian B. Moser
Tobias Christian Nauen
Arundhati S. Shanbhag
Federico Raue
Stanislav Frolov
Joachim Folz
Andreas Dengel
251
0
0
26 Sep 2025
DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression
Wei Huang
Huang Wei
Yinggui Wang
276
0
0
01 Sep 2025
InSQuAD: In-Context Learning for Efficient Retrieval via Submodular Mutual Information to Enforce Quality and Diversity
Souradeep Nanda
Anay Majee
Rishabh K. Iyer
175
0
0
28 Aug 2025
Coresets from Trajectories: Selecting Data via Correlation of Loss Differences
M. Nagaraj
Deepak Ravikumar
Kaushik Roy
303
3
0
27 Aug 2025
Dataset Condensation with Color Compensation
Huyu Wu
Duo Su
Junjie Hou
Guang Li
DD
548
3
0
02 Aug 2025
Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws
Hidetaka Kamigaito
Ying Zhang
Jingun Kwon
Katsuhiko Hayashi
Manabu Okumura
Taro Watanabe
MoE
315
3
0
29 May 2025
A Coreset Selection of Coreset Selection Literature: Introduction and Recent Advances
Brian B. Moser
Arundhati S. Shanbhag
Stanislav Frolov
Federico Raue
Joachim Folz
Andreas Dengel
609
14
0
23 May 2025
Data-efficient LLM Fine-tuning for Code Generation
Weijie Lv
X. Xia
Sheng-Jun Huang
ALM
SyDa
206
4
0
17 Apr 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
International Conference on Learning Representations (ICLR), 2025
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
307
1
0
26 Feb 2025
Challenges of Multi-Modal Coreset Selection for Depth Prediction
Viktor Moskvoretskii
Narek Alvandian
245
0
0
20 Feb 2025
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
Computer Vision and Pattern Recognition (CVPR), 2024
Arnav M. Das
Gantavya Bhatt
Lilly Kumari
Sahil Verma
J. Bilmes
439
0
0
23 Dec 2024
Adaptive Dataset Quantization
AAAI Conference on Artificial Intelligence (AAAI), 2024
Muquan Li
Dongyang Zhang
Qiang Dong
Xiurui Xie
Ke Qin
DD
MQ
422
9
0
22 Dec 2024
Color-Oriented Redundancy Reduction in Dataset Distillation
Neural Information Processing Systems (NeurIPS), 2024
Bowen Yuan
Zijian Wang
Mahsa Baktashmotlagh
Yadan Luo
Zi Huang
DD
643
7
0
18 Nov 2024
DELIFT: Data Efficient Language model Instruction Fine Tuning
International Conference on Learning Representations (ICLR), 2024
Ishika Agarwal
Krishnateja Killamsetty
Yatin Nandwani
Marina Danilevksy
ALM
VLM
895
10
0
07 Nov 2024
Theoretically Grounded Pruning of Large Ground Sets for Constrained, Discrete Optimization
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Ankur Nath
Alan Kuhnle
244
0
0
23 Oct 2024
Continual Learning on a Data Diet
Elif Ceren Gok Yildirim
Murat Onur Yildirim
Joaquin Vanschoren
CLL
182
1
0
23 Oct 2024
A CLIP-Powered Framework for Robust and Generalizable Data Selection
International Conference on Learning Representations (ICLR), 2024
Steve Yang
Peng Ye
Wanli Ouyang
Dongzhan Zhou
Jian Zhao
447
21
0
15 Oct 2024
Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
International Conference on Learning Representations (ICLR), 2024
Tianchi Xie
Jiangning Zhu
Guozu Ma
Minzhi Lin
Wei Chen
Weikai Yang
Shixia Liu
465
4
0
03 Oct 2024
Distilling Long-tailed Datasets
Computer Vision and Pattern Recognition (CVPR), 2024
Zhenghao Zhao
Haoxuan Wang
Yuzhang Shang
Kai Wang
Yan Yan
DD
458
6
0
24 Aug 2024
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs
Weijie Lv
Xuan Xia
Sheng-Jun Huang
ALM
255
14
0
05 Aug 2024
Dataset Quantization with Active Learning based Adaptive Sampling
Zhenghao Zhao
Yuzhang Shang
Junyi Wu
Yan Yan
DD
345
12
0
09 Jul 2024
Sub-SA: Strengthen In-context Learning via Submodular Selective Annotation
Jian Qian
Miao Sun
Sifan Zhou
Ziyu Zhao
Ruizhi Hun
Patrick Chiang
395
3
0
08 Jul 2024
GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility
Matthew Fahrbach
Srikumar Ramalingam
Morteza Zadimoghaddam
Sara Ahmadian
Gui Citovsky
Giulia DeSalvo
336
1
0
29 May 2024
Dataset Growth
Ziheng Qin
Zhaopan Xu
Yukun Zhou
Zangwei Zheng
Zebang Cheng
...
Xiaojiang Peng
Radu Timofte
Hongxun Yao
Kai Wang
Yang You
DD
221
4
0
28 May 2024
Calibrated Dataset Condensation for Faster Hyperparameter Search
Mucong Ding
Yuancheng Xu
Tahseen Rabbani
Xiaoyu Liu
Brian J Gravelle
Teresa M. Ranadive
Tai-Ching Tuan
Furong Huang
DD
256
3
0
27 May 2024
Spectral Greedy Coresets for Graph Neural Networks
Mucong Ding
Yinhan He
Jundong Li
Furong Huang
285
3
0
27 May 2024
SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning
Bowei Tian
Ziyao Wang
Zheyu Shen
Guoheng Sun
Yucong Dai
Yongkai Wu
Hongyi Wang
Ang Li
235
20
0
23 Apr 2024
Coreset Selection for Object Detection
Hojun Lee
Suyoung Kim
Junhoo Lee
Jaeyoung Yoo
Nojun Kwak
308
22
0
14 Apr 2024
On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
Maximilian Böther
Abraham Sebastian
Pranjal Awasthi
Ana Klimovic
Srikumar Ramalingam
392
1
0
26 Feb 2024
STENCIL: Submodular Mutual Information Based Weak Supervision for Cold-Start Active Learning
Nathan Beck
Adithya Iyer
Rishabh K. Iyer
334
1
0
21 Feb 2024
Theoretical Analysis of Submodular Information Measures for Targeted Data Subset Selection
Nathan Beck
Truong Pham
Rishabh K. Iyer
290
2
0
21 Feb 2024
Is Adversarial Training with Compressed Datasets Effective?
Tong Chen
Raghavendra Selvan
AAML
649
1
0
08 Feb 2024
Automatic Combination of Sample Selection Strategies for Few-Shot Learning
Branislav Pecher
Ivan Srba
Maria Bielikova
Joaquin Vanschoren
360
4
0
05 Feb 2024
Contributing Dimension Structure of Deep Feature for Coreset Selection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Zhijing Wan
Zhixiang Wang
Yuran Wang
Zheng Wang
Hongyuan Zhu
Shiníchi Satoh
342
11
0
29 Jan 2024
Fusing Conditional Submodular GAN and Programmatic Weak Supervision
AAAI Conference on Artificial Intelligence (AAAI), 2023
Kumar Shubham
Pranav Sastry
AP Prathosh
390
3
0
16 Dec 2023
Benchmarking of Query Strategies: Towards Future Deep Active Learning
Shiryu Ueno
Yusei Yamada
Shunsuke Nakatsuka
Kunihito Kato
FedML
261
2
0
10 Dec 2023
Dataset Distillation in Latent Space
Yuxuan Duan
Jianfu Zhang
Liqing Zhang
DD
299
7
0
27 Nov 2023
Dataset Quantization
IEEE International Conference on Computer Vision (ICCV), 2023
Daquan Zhou
Kaixin Wang
Jianyang Gu
Xiang Peng
Dongze Lian
Yifan Zhang
Yang You
Jiashi Feng
DD
232
65
0
21 Aug 2023
Beyond Active Learning: Leveraging the Full Potential of Human Interaction via Auto-Labeling, Human Correction, and Human Verification
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Nathan Beck
Krishnateja Killamsetty
Suraj Kothawade
Rishabh K. Iyer
223
7
0
02 Jun 2023
Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning
International Conference on Learning Representations (ICLR), 2023
Patrik Okanovic
R. Waleffe
Vasilis Mageirakos
Konstantinos E. Nikolakakis
Amin Karbasi
Dionysis Kalogerias
Nezihe Merve Gürel
Theodoros Rekatsinas
DD
294
28
0
28 May 2023
STREAMLINE: Streaming Active Learning for Realistic Multi-Distributional Settings
Nathan Beck
Suraj Kothawade
Pradeep Shenoy
Rishabh K. Iyer
275
4
0
18 May 2023
INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
H. S. V. N. S. K. Renduchintala
Krishnateja Killamsetty
S. Bhatia
Milan Aggarwal
Ganesh Ramakrishnan
Rishabh K. Iyer
Balaji Krishnamurthy
AIFin
166
5
0
11 May 2023
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
International Conference on Learning Representations (ICLR), 2023
Ziheng Qin
Kaidi Wang
Zangwei Zheng
Jianyang Gu
Xiang Peng
...
Daquan Zhou
Lei Shang
Baigui Sun
Xuansong Xie
Yang You
424
93
0
08 Mar 2023
Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection
Neural Information Processing Systems (NeurIPS), 2022
A. De
Soumen Chakrabarti
437
8
0
20 Oct 2022
DIAGNOSE: Avoiding Out-of-distribution Data using Submodular Information Measures
Suraj Kothawade
Akshit Shrivastava
V. Iyer
Ganesh Ramakrishnan
Rishabh K. Iyer
133
1
0
04 Oct 2022
CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification
Suraj Kothawade
Atharv Savarkar
V. Iyer
Lakshman Tamil
Ganesh Ramakrishnan
Rishabh K. Iyer
182
12
0
04 Oct 2022
Unifying Approaches in Active Learning and Active Sampling via Fisher Information and Information-Theoretic Quantities
Andreas Kirsch
Y. Gal
FedML
299
30
0
01 Aug 2022
Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios
Xueying Zhan
Zeyu Dai
Qingzhong Wang
Qing Li
Haoyi Xiong
Dejing Dou
Antoni B. Chan
OODD
200
3
0
04 Jul 2022
Active Data Discovery: Mining Unknown Data using Submodular Information Measures
Suraj Kothawade
Shivang Chopra
Saikat Ghosh
Rishabh K. Iyer
192
6
0
17 Jun 2022
DeepCore: A Comprehensive Library for Coreset Selection in Deep Learning
International Conference on Database and Expert Systems Applications (DEXA), 2022
Chengcheng Guo
B. Zhao
Yanbing Bai
OOD
570
205
0
18 Apr 2022
1
2
Next
Page 1 of 2