Improving Dense Crowd Counting Convolutional Neural Networks using
Inverse k-Nearest Neighbor Maps and Multiscale Upsampling
Gatherings of thousands to millions of people occur frequently for an enormous variety of events, and automated counting of these high density crowds is used for safety, management, and measuring significance of these events. In this work, we show that the regularly accepted labeling scheme of crowd density maps for training deep neural networks is less effective than our alternative inverse k-nearest neighbor (iNN) maps, even when used directly in existing state-of-the-art network structures. We also provide a new network architecture MUD-iNN, which uses multi-scale upsampling via transposed convolutions to take full advantage of the provided iNN labeling. This upsampling combined with the iNN maps further outperforms the existing state-of-the-art methods. The full label comparison emphasizes the importance of the labeling scheme, with the iNN labeling being particularly effective. We demonstrate the accuracy of our MUD-iNN network and the iNN labeling scheme on a variety of datasets.
View on arXiv