Linear Hashing with guarantees and two-sided Kakeya bounds
We show that a randomly chosen linear map over a finite field gives a good hash function in the sense. More concretely, consider a set and a randomly chosen linear map with taken to be sufficiently smaller than $ |S|$. Let denote a random variable distributed uniformly on . Our main theorem shows that, with high probability over the choice of , the random variable is close to uniform in the norm. In other words, {\em every} element in the range has about the same number of elements in mapped to it. This complements the widely-used Leftover Hash Lemma (LHL) which proves the analog statement under the statistical, or , distance (for a richer class of functions) as well as prior work on the expected largest 'bucket size' in linear hash functions [ADMPT99]. By known bounds from the load balancing literature [RS98], our results are tight and show that linear functions hash as well as trully random function up to a constant factor in the entropy loss. Our proof leverages a connection between linear hashing and the finite field Kakeya problem and extends some of the tools developed in this area, in particular the polynomial method.
View on arXiv