Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1702.01166
Cited By
v1
v2 (latest)
Optimal Subsampling for Large Sample Logistic Regression
3 February 2017
Haiying Wang
Rong Zhu
Ping Ma
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Optimal Subsampling for Large Sample Logistic Regression"
50 / 61 papers shown
Train on Validation (ToV): Fast data selection with applications to fine-tuning
Ayush Jain
Andrea Montanari
Eren Sasoglu
315
2
0
01 Oct 2025
Core-elements Subsampling for Alternating Least Squares
Dunyao Xue
Mengyu Li
Cheng Meng
Jingyi Zhang
193
0
0
22 Sep 2025
Sublinear Algorithms for Wasserstein and Total Variation Distances: Applications to Fairness and Privacy Auditing
Debabrota Basu
Debarshi Chanda
335
0
0
10 Mar 2025
Novel Subsampling Strategies for Heavily Censored Reliability Data
Statistics and its Interface (SII), 2024
Yixiao Ruan
Z. Li
Zhaohui Li
Dennis K. J. Lin
Qingpei Hu
Dan Yu
204
1
0
30 Oct 2024
Refitted cross-validation estimation for high-dimensional subsamples from low-dimension full data
Haixiang Zhang
Haiying Wang
219
1
0
21 Sep 2024
Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
Yijun Dong
Hoang Phan
Xiang Pan
Qi Lei
523
8
0
08 Jul 2024
Multi-resolution subsampling for large-scale linear classification
Haolin Chen
Holger Dette
Jun Yu
315
1
0
08 Jul 2024
General bounds on the quality of Bayesian coresets
Trevor Campbell
249
3
0
20 May 2024
A model-free subdata selection method for classification
Rakhi Singh
278
0
0
29 Apr 2024
Poisson Regression in one Covariate on Massive Data
Torsten Reuter
Rainer Schwabe
156
0
0
27 Mar 2024
A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques
Xuetong Li
Yuan Gao
Hong Chang
Danyang Huang
Yingying Ma
...
Ke Xu
Jing Zhou
Xuening Zhu
Yingqiu Zhu
Hansheng Wang
231
18
0
17 Mar 2024
Subsampling for Big Data Linear Models with Measurement Errors
Jiangshan Ju
Mingqiu Wang
Shengli Zhao
245
2
0
07 Mar 2024
A Provably Accurate Randomized Sampling Algorithm for Logistic Regression
Agniva Chowdhury
Pradeep Ramuhalli
269
1
0
26 Feb 2024
Towards a statistical theory of data selection under weak supervision
International Conference on Learning Representations (ICLR), 2023
Germain Kolossov
Andrea Montanari
Pulkit Tandon
354
27
0
25 Sep 2023
Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Yong Lin
Chen Liu
Chen Ye
Qing Lian
Xingtai Lv
Tong Zhang
298
5
0
05 Sep 2023
On the asymptotic properties of a bagging estimator with a massive dataset
Yuan Gao
Riquan Zhang
Hansheng Wang
197
1
0
13 Apr 2023
Optimal subsampling designs
Henrik Imberg
Marina Axelson-Fisk
J. Jonasson
213
3
0
06 Apr 2023
Optimal Sampling Designs for Multi-dimensional Streaming Time Series with Application to Power Grid Sensor Data
Annals of Applied Statistics (AOAS), 2023
Rui Xie
Shuyang Bai
Ping Ma
AI4TS
177
10
0
14 Mar 2023
Gaussian Switch Sampling: A Second Order Approach to Active Learning
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
Ryan Benkert
Mohit Prabhushankar
Ghassan Al-Regib
Armin Pacharmi
E. Corona
AAML
318
13
0
16 Feb 2023
Optimal subsampling for the Cox proportional hazards model with massive survival data
Journal of Statistical Planning and Inference (JSPI), 2023
Nan Qiao
Wangcheng Li
Fengjun Xiao
Cunjie Lin
Yong Zhou
218
6
0
05 Feb 2023
A Coreset Learning Reality Check
AAAI Conference on Artificial Intelligence (AAAI), 2023
Fred Lu
Edward Raff
James Holt
179
5
0
15 Jan 2023
Optimal subsampling algorithm for composite quantile regression with distributed data
Computational statistics (Zeitschrift) (CSZ), 2023
Xiaohui Yuan
Shiting Zhou
Yue Wang
126
3
0
06 Jan 2023
Least product relative error estimation for functional multiplicative model and optimal subsampling
Qian Yan
Hanyu Li
145
0
0
03 Jan 2023
Active sampling: A machine-learning-assisted framework for finite population inference with optimal subsamples
Henrik Imberg
Xiaomi Yang
Carol Flannagan
Jonas Bärgman
508
11
0
20 Dec 2022
Fast Calibration for Computer Models with Massive Physical Observations
Shurui Lv
Yan Wang
Junrong Yu
121
3
0
23 Nov 2022
Approximating Partial Likelihood Estimators via Optimal Subsampling
Journal of Computational And Graphical Statistics (JCGS), 2022
Haixiang Zhang
Lulu Zuo
Haiying Wang
Liuquan Sun
345
17
0
10 Oct 2022
Unweighted estimation based on optimal sample under measurement constraints
Canadian journal of statistics (CJS), 2022
Jing Wang
Haiying Wang
Shifeng Xiong
220
4
0
08 Oct 2022
Model-free Subsampling Method Based on Uniform Designs
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Mei Zhang
Yongdao Zhou
Zhengze Zhou
Aijun Zhang
161
17
0
08 Sep 2022
A sub-sampling algorithm preventing outliers
L. Deldossi
E. Pesce
Chiara Tommasi
133
2
0
12 Aug 2022
Density Regression with Conditional Support Points
Yunlu Chen
N. Zhang
148
0
0
14 Jun 2022
An optimal transport approach for selecting a representative subsample with application in efficient kernel density estimation
Journal of Computational And Graphical Statistics (JCGS), 2022
Jingyi Zhang
Cheng Meng
Jun Yu
Mengrui Zhang
Wenxuan Zhong
Ping Ma
OT
232
21
0
31 May 2022
Sampling with replacement vs Poisson sampling: a comparative study in optimal subsampling
IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Jing Wang
Jiahui Zou
Haiying Wang
206
29
0
17 May 2022
Optimal subsampling for functional quantile regression
Statistical Papers (SP), 2022
Qian Yan
Hanyu Li
Chengmei Niu
205
7
0
05 May 2022
Optimal Subsampling for High-dimensional Ridge Regression
Knowledge-Based Systems (KBS), 2022
Hanyu Li
Cheng Niu
218
9
0
18 Apr 2022
Parallel-and-stream accelerator for computationally fast supervised learning
Computational Statistics & Data Analysis (CSDA), 2021
Emily C. Hector
Lan Luo
P. Song
214
10
0
29 Oct 2021
Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data
Neural Information Processing Systems (NeurIPS), 2021
Haiying Wang
Aonan Zhang
Chong-Jun Wang
145
24
0
25 Oct 2021
Functional Principal Subspace Sampling for Large Scale Functional Data Analysis
Electronic Journal of Statistics (EJS), 2021
Shiyuan He
Xiaomeng Yan
343
5
0
08 Sep 2021
Coresets for Classification -- Simplified and Strengthened
Neural Information Processing Systems (NeurIPS), 2021
Tung Mai
Anup B. Rao
Cameron Musco
306
37
0
08 Jun 2021
One Backward from Ten Forward, Subsampling for Large-Scale Deep Learning
Chaosheng Dong
Xiaojie Jin
Weihao Gao
Yijia Wang
Hongyi Zhang
Xiang Wu
Jianchao Yang
Xiaobing Liu
260
6
0
27 Apr 2021
Functional L-Optimality Subsampling for Massive Data
Hua Liu
Jinhong You
Jiguo Cao
305
4
0
08 Apr 2021
On the Subbagging Estimation for Massive Data
Tao Zou
Xian Li
Xuan Liang
Hansheng Wang
154
4
0
28 Feb 2021
Balance-Subsampled Stable Prediction
Kun Kuang
Hengtao Zhang
Leilei Gan
Yueting Zhuang
Aijun Zhang
OOD
155
4
0
08 Jun 2020
Optimal Distributed Subsampling for Maximum Quasi-Likelihood Estimators with Massive Data
Jun Yu
Haiying Wang
Mingyao Ai
Huiming Zhang
306
134
0
21 May 2020
Statistical inference in massive datasets by empirical likelihood
Computational statistics (Zeitschrift) (CSZ), 2020
Xuejun Ma
Shaochen Wang
Wang Zhou
FedML
212
7
0
18 Apr 2020
Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Ping Ma
Xinlian Zhang
Xin Xing
Jingyi Ma
Michael W. Mahoney
212
71
0
24 Feb 2020
Big Data and model-based survey sampling
Deldossi Laura
Tommasi Chiara
53
3
0
11 Feb 2020
Optimal subsampling for quantile regression in big data
Biometrika (Biometrika), 2020
Haiying Wang
Yanyuan Ma
348
153
0
28 Jan 2020
Randomized Spectral Clustering in Large-Scale Stochastic Block Models
Journal of Computational And Graphical Statistics (JCGS), 2020
Hai Zhang
Xiao Guo
Xiangyu Chang
494
31
0
20 Jan 2020
Communication-Efficient Distributed Estimator for Generalized Linear Models with a Diverging Number of Covariates
Computational Statistics & Data Analysis (CSDA), 2020
Ping Zhou
Zhen Yu
Jingyi Ma
M. Tian
Ye Fan
222
7
0
17 Jan 2020
Logistic regression models for aggregated data
Journal of Computational And Graphical Statistics (JCGS), 2019
Thomas Whitaker
B. Beranger
Scott A. Sisson
224
20
0
09 Dec 2019
1
2
Next
Page 1 of 2