Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.01166
Cited By
v1
v2 (latest)
Optimal Subsampling for Large Sample Logistic Regression
3 February 2017
Haiying Wang
Rong Zhu
Ping Ma
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Optimal Subsampling for Large Sample Logistic Regression"
50 / 59 papers shown
Title
Sublinear Algorithms for Wasserstein and Total Variation Distances: Applications to Fairness and Privacy Auditing
Debabrota Basu
Debarshi Chanda
109
0
0
10 Mar 2025
Novel Subsampling Strategies for Heavily Censored Reliability Data
Yixiao Ruan
Z. Li
Zhaohui Li
Dennis K. J. Lin
Qingpei Hu
Dan Yu
55
0
0
30 Oct 2024
Refitted cross-validation estimation for high-dimensional subsamples from low-dimension full data
Haixiang Zhang
Haiying Wang
52
1
0
21 Sep 2024
Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
Yijun Dong
Hoang Phan
Xiang Pan
Qi Lei
150
6
0
08 Jul 2024
Multi-resolution subsampling for large-scale linear classification
Haolin Chen
Holger Dette
Jun Yu
68
0
0
08 Jul 2024
General bounds on the quality of Bayesian coresets
Trevor Campbell
81
2
0
20 May 2024
A model-free subdata selection method for classification
Rakhi Singh
69
0
0
29 Apr 2024
Poisson Regression in one Covariate on Massive Data
Torsten Reuter
Rainer Schwabe
51
0
0
27 Mar 2024
A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques
Xuetong Li
Yuan Gao
Hong Chang
Danyang Huang
Yingying Ma
...
Ke Xu
Jing Zhou
Xuening Zhu
Yingqiu Zhu
Hansheng Wang
68
9
0
17 Mar 2024
Subsampling for Big Data Linear Models with Measurement Errors
Jiangshan Ju
Mingqiu Wang
Shengli Zhao
119
0
0
07 Mar 2024
A Provably Accurate Randomized Sampling Algorithm for Logistic Regression
Agniva Chowdhury
Pradeep Ramuhalli
75
2
0
26 Feb 2024
Towards a statistical theory of data selection under weak supervision
Germain Kolossov
Andrea Montanari
Pulkit Tandon
76
15
0
25 Sep 2023
Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Yong Lin
Chen Liu
Chen Ye
Qing Lian
Yuan Yao
Tong Zhang
102
5
0
05 Sep 2023
On the asymptotic properties of a bagging estimator with a massive dataset
Yuan Gao
Riquan Zhang
Hansheng Wang
42
1
0
13 Apr 2023
Optimal subsampling designs
Henrik Imberg
Marina Axelson-Fisk
J. Jonasson
59
3
0
06 Apr 2023
Optimal Sampling Designs for Multi-dimensional Streaming Time Series with Application to Power Grid Sensor Data
Rui Xie
Shuyang Bai
Ping Ma
AI4TS
48
7
0
14 Mar 2023
Gaussian Switch Sampling: A Second Order Approach to Active Learning
Ryan Benkert
Mohit Prabhushankar
Ghassan Al-Regib
Armin Pacharmi
E. Corona
AAML
90
9
0
16 Feb 2023
Optimal subsampling for the Cox proportional hazards model with massive survival data
Nan Qiao
Wangcheng Li
Fengjun Xiao
Cunjie Lin
Yong Zhou
62
2
0
05 Feb 2023
A Coreset Learning Reality Check
Fred Lu
Edward Raff
James Holt
55
5
0
15 Jan 2023
Optimal subsampling algorithm for composite quantile regression with distributed data
Xiaohui Yuan
Shiting Zhou
Yue Wang
27
2
0
06 Jan 2023
Least product relative error estimation for functional multiplicative model and optimal subsampling
Qian Yan
Hanyu Li
20
0
0
03 Jan 2023
Active sampling: A machine-learning-assisted framework for finite population inference with optimal subsamples
Henrik Imberg
Xiaomi Yang
Carol Flannagan
Jonas Bärgman
111
10
0
20 Dec 2022
Fast Calibration for Computer Models with Massive Physical Observations
Shurui Lv
Yan Wang
Junrong Yu
41
1
0
23 Nov 2022
Approximating Partial Likelihood Estimators via Optimal Subsampling
Haixiang Zhang
Lulu Zuo
Haiying Wang
Liuquan Sun
53
14
0
10 Oct 2022
Unweighted estimation based on optimal sample under measurement constraints
Jing Wang
Haiying Wang
Shifeng Xiong
59
4
0
08 Oct 2022
Model-free Subsampling Method Based on Uniform Designs
Mei Zhang
Yongdao Zhou
Zhengze Zhou
Aijun Zhang
47
14
0
08 Sep 2022
A sub-sampling algorithm preventing outliers
L. Deldossi
E. Pesce
Chiara Tommasi
16
1
0
12 Aug 2022
Density Regression with Conditional Support Points
Yunlu Chen
N. Zhang
53
0
0
14 Jun 2022
An optimal transport approach for selecting a representative subsample with application in efficient kernel density estimation
Jingyi Zhang
Cheng Meng
Jun Yu
Mengrui Zhang
Wenxuan Zhong
Ping Ma
OT
74
12
0
31 May 2022
Sampling with replacement vs Poisson sampling: a comparative study in optimal subsampling
Jing Wang
Jiahui Zou
Haiying Wang
73
18
0
17 May 2022
Optimal subsampling for functional quantile regression
Qian Yan
Hanyu Li
Chengmei Niu
51
7
0
05 May 2022
Optimal Subsampling for High-dimensional Ridge Regression
Hanyu Li
Cheng Niu
16
3
0
18 Apr 2022
Parallel-and-stream accelerator for computationally fast supervised learning
Emily C. Hector
Lan Luo
P. Song
21
6
0
29 Oct 2021
Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data
Haiying Wang
Aonan Zhang
Chong-Jun Wang
37
19
0
25 Oct 2021
Functional Principal Subspace Sampling for Large Scale Functional Data Analysis
Shiyuan He
Xiaomeng Yan
93
4
0
08 Sep 2021
Coresets for Classification -- Simplified and Strengthened
Tung Mai
Anup B. Rao
Cameron Musco
86
35
0
08 Jun 2021
One Backward from Ten Forward, Subsampling for Large-Scale Deep Learning
Chaosheng Dong
Xiaojie Jin
Weihao Gao
Yijia Wang
Hongyi Zhang
Xiang Wu
Jianchao Yang
Xiaobing Liu
75
5
0
27 Apr 2021
Functional L-Optimality Subsampling for Massive Data
Hua Liu
Jinhong You
Jiguo Cao
55
4
0
08 Apr 2021
On the Subbagging Estimation for Massive Data
Tao Zou
Xian Li
Xuan Liang
Hansheng Wang
34
4
0
28 Feb 2021
Balance-Subsampled Stable Prediction
Kun Kuang
Hengtao Zhang
Leilei Gan
Yueting Zhuang
Aijun Zhang
OOD
65
3
0
08 Jun 2020
Optimal Distributed Subsampling for Maximum Quasi-Likelihood Estimators with Massive Data
Jun Yu
Haiying Wang
Mingyao Ai
Huiming Zhang
51
116
0
21 May 2020
Statistical inference in massive datasets by empirical likelihood
Xuejun Ma
Shaochen Wang
Wang Zhou
FedML
13
6
0
18 Apr 2020
Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms
Ping Ma
Xinlian Zhang
Xin Xing
Jingyi Ma
Michael W. Mahoney
99
57
0
24 Feb 2020
Big Data and model-based survey sampling
Deldossi Laura
Tommasi Chiara
19
3
0
11 Feb 2020
Optimal subsampling for quantile regression in big data
Haiying Wang
Yanyuan Ma
139
133
0
28 Jan 2020
Randomized Spectral Clustering in Large-Scale Stochastic Block Models
Hai Zhang
Xiao Guo
Xiangyu Chang
115
24
0
20 Jan 2020
Communication-Efficient Distributed Estimator for Generalized Linear Models with a Diverging Number of Covariates
Ping Zhou
Zhen Yu
Jingyi Ma
M. Tian
Ye Fan
26
6
0
17 Jan 2020
Logistic regression models for aggregated data
Thomas Whitaker
B. Beranger
Scott A. Sisson
36
14
0
09 Dec 2019
Less Is Better: Unweighted Data Subsampling via Influence Function
Zifeng Wang
Hong Zhu
Zhenhua Dong
Xiuqiang He
Shao-Lun Huang
TDI
94
54
0
03 Dec 2019
Least Squares Approximation for a Distributed System
Xuening Zhu
Feng Li
Hansheng Wang
48
56
0
14 Aug 2019
1
2
Next