ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13160
12
2

The Effect of Points Dispersion on the kkk-nn Search in Random Projection Forests

25 February 2023
Mashaan Alshammari
J. Stavrakakis
Adel F. Ahmed
M. Takatsuka
ArXivPDFHTML
Abstract

Partitioning trees are efficient data structures for kkk-nearest neighbor search. Machine learning libraries commonly use a special type of partitioning trees called kkkd-trees to perform kkk-nn search. Unfortunately, kkkd-trees can be ineffective in high dimensions because they need more tree levels to decrease the vector quantization (VQ) error. Random projection trees rpTrees solve this scalability problem by using random directions to split the data. A collection of rpTrees is called rpForest. kkk-nn search in an rpForest is influenced by two factors: 1) the dispersion of points along the random direction and 2) the number of rpTrees in the rpForest. In this study, we investigate how these two factors affect the kkk-nn search with varying kkk values and different datasets. We found that with larger number of trees, the dispersion of points has a very limited effect on the kkk-nn search. One should use the original rpTree algorithm by picking a random direction regardless of the dispersion of points.

View on arXiv
Comments on this paper