Improved -values for discrete uniform and homogeneous tests: a
comparative study
Large scale discrete uniform and homogeneous -values often arise in applications with multiple testing. For example, this occurs in genome wide association studies whenever a nonparametric one-sample (or two-sample) test is applied throughout the gene loci. In this paper we consider -values for such scenarios based on several existing estimators for the proportion of true null hypothesis, , which take the discreteness of the -values into account. The theoretical guarantees of the several approaches with respect to the estimation of and the false discovery rate control are reviewed. The performance of the discrete -values is investigated through intensive Monte Carlo simulations, including location, scale and omnibus nonparametric tests, and possibly dependent -values. The methods are applied to genetic and financial data for illustration purposes too. Since the particular estimator of used to compute the -values may influence the power, relative advantages and disadvantages of the reviewed procedures are discussed. Practical recommendations are given.
View on arXiv