Bayesian Posteriors for Small Multinomial Probabilities

Each period, either a blue die or a red die is tossed. The two dice land on side with unknown probabilities and , which can be arbitrarily low. Given a data-generating process where , we are interested in how much data is required to guarantee that with high probability the observer's Bayesian posterior mean for exceeds times that for . If the prior densities for the two dice are positive on the interior of the probability simplex and behave like power functions at the boundary, then for every there exists so that the observer obtains such an inference after periods with probability at least whenever . The condition on and is the best possible. The result can fail if one of the prior densities converges to zero exponentially fast at the boundary.
View on arXiv