226
v1v2v3v4 (latest)

Another look at Bootstrapping the Student t-statistic

Abstract

Let X, X_1,X_2,... be a sequence of i.i.d. random variables with mean μ=EX\mu=E X. Let v1(n),...,vn(n)n=1{v_1^{(n)},...,v_n^{(n)}}_{n=1}^\infty be vectors of non-negative random variables (weights), independent of the data sequence X1,...,Xnn=1{X_1,...,X_n}_{n=1}^\infty, and put mn=\sumnvi(n)m_n=\sumn v_i^{(n)}. Consider $ X^{*}_1,..., X^{*}_{m_n}$, mn1m_n\geq 1, a bootstrap sample, resulting from re-sampling or stochastically re-weighing a random sample X1,...,XnX_1,...,X_n, n1n\geq 1. Put Xˉn=\sumnXi/n\bar{X}_n= \sumn X_i/n, the original sample mean, and define Xˉmn=\sumnvi(n)Xi/mn\bar{X^*}_{m_n}=\sumn v_i^{(n)} X_i/m_n, the bootstrap sample mean. Thus, XˉmnXˉn=\sumn(vi(n)/mn1/n)Xi\bar{X^*}_{m_n}- \bar{X}_n=\sumn ({v_i^{(n)}}/{m_n}-{1}/{n}) X_i. Put Vn2=\sumn(vi(n)/mn1/n)2V_n^{2}=\sumn ({v_i^{(n)}}/{m_n}-{1}/{n})^2 and let Sn2S_n^{2}, Smn2S_{m_{n}}^{*^{2}} respectively be the the original sample variance and the bootstrap sample variance. The main aim of this exposition is to study the asymptotic behavior of the bootstrapped tt-statistics Tmn:=(XˉmnXˉn)/(SnVn)T_{m_n}^{*}:= (\bar{X^*}_{m_n}- \bar{X}_n)/(S_n V_n) and $T_{m_n}^{**}:= \sqrt{m_n}(\bar{X^*}_{m_n}- \bar{X}_n)/ S_{m_{n}}^{*} $ in terms of conditioning on the weights via assuming that, as n,mnn,m_n\to \infty, max1in(vi(n)/mn1/n)2/Vn2=o(1)\max_{1\leq i \leq n}({v_i^{(n)}}/{m_n}-{1}/{n})^2\big/ V_n^{2}=o(1) almost surely or in probability on the probability space of the weights. This view of justifying the validity of the bootstrap is believed to be new. The need for it arises naturally in practice when exploring the nature of information contained in a random sample via re-sampling, for example. Conditioning on the data is also revisited for Efron's bootstrap weights under conditions on n,mnn,m_n as $n\to \infty $ that differ from requiring mn/nm_n /n to be in the interval (λ1,λ2)(\lambda_1,\lambda_2) with 0< \lambda_1 < \lambda_2 < \infty as in Mason and Shao. Also, the validity of the bootstrapped tt-intervals for both approaches to conditioning is established.

View on arXiv
Comments on this paper