Optimal Data-Dependent Hashing for Approximate Near Neighbors

Type: Article

Publication Date: 2015-06-03

Citations: 185

DOI: https://doi.org/10.1145/2746539.2746553

Download PDF

Abstract

We show an optimal data-dependent hashing scheme for the approximate near neighbor problem. For an n-point dataset in a d-dimensional space our data structure achieves query time O(d ⋅ nρ+o(1)) and space O(n1+ρ+o(1) + d ⋅ n), where ρ=1/(2c2-1) for the Euclidean space and approximation c>1. For the Hamming space, we obtain an exponent of ρ=1/(2c-1). Our result completes the direction set forth in (Andoni, Indyk, Nguyen, Razenshteyn 2014) who gave a proof-of-concept that data-dependent hashing can outperform classic Locality Sensitive Hashing (LSH). In contrast to (Andoni, Indyk, Nguyen, Razenshteyn 2014), the new bound is not only optimal, but in fact improves over the best (optimal) LSH data structures (Indyk, Motwani 1998) (Andoni, Indyk 2006) for all approximation factors c>1.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Optimal Data-Dependent Hashing for Approximate Near Neighbors 2015 Alexandr Andoni
Ilya Razenshteyn
+ Beyond Locality-Sensitive Hashing 2013 Alexandr Andoni
Piotr Indyk
Huy L. Nguyễn
Ilya Razenshteyn
+ Optimal hashing-based time-space trade-offs for approximate near neighbors 2017 Alexandr Andoni
Thijs Laarhoven
Ilya Razenshteyn
Erik Waingarten
+ Tight Lower Bounds for Data-Dependent Locality-Sensitive Hashing 2015 Alexandr Andoni
Ilya Razenshteyn
+ Optimal Hashing-based Time-Space Trade-offs for Approximate Near Neighbors 2017 Alexandr Andoni
Thijs Laarhoven
Ilya Razenshteyn
Erik Waingarten
+ PDF Chat Beyond Locality-Sensitive Hashing 2013 Alexandr Andoni
Piotr Indyk
Huy L. Nguyễn
Ilya Razenshteyn
+ Fast Locality-Sensitive Hashing for Approximate Near Neighbor Search. 2017 Tobias Christiani
+ Fast Locality-Sensitive Hashing Frameworks for Approximate Near Neighbor Search 2017 Tobias Christiani
+ Fast Locality-Sensitive Hashing Frameworks for Approximate Near Neighbor Search 2017 Tobias Christiani
+ A Survey on Locality Sensitive Hashing Algorithms and their Applications 2021 Omid Jafari
Preeti Maurya
Parth Nagarkar
Khandker Mushfiqul Islam
Chidambaram Crushev
+ Low-quality dimension reduction and high-dimensional Approximate Nearest Neighbor 2014 Evangelos Anagnostopoulos
Ioannis Z. Emiris
Ioannis Psarros
+ Practical linear-space Approximate Near Neighbors in high dimension 2016 Georgia Avarikioti
Ioannis Z. Emiris
Ioannis Psarros
Georgios Samaras
+ A Survey on Locality Sensitive Hashing Algorithms and their Applications. 2021 Omid Jafari
Preeti Maurya
Parth Nagarkar
Khandker Mushfiqul Islam
Chidambaram Crushev
+ Practical linear-space Approximate Near Neighbors in high dimension. 2016 Georgia Avarikioti
Ioannis Z. Emiris
Ioannis Psarros
Georgios Samaras
+ Reverse Nearest Neighbors Search in High Dimensions using Locality-Sensitive Hashing 2010 David Arthur
Steve Oudot
+ PDF Chat Data-Dependent LSH for the Earth Mover's Distance 2024 Rajesh Jayaram
Erik Waingarten
Tian Zhang
+ PDF Chat Fast Locality-Sensitive Hashing Frameworks for Approximate Near Neighbor Search 2019 Tobias Christiani
+ Data-Dependent LSH for the Earth Mover’s Distance 2024 Rajesh Jayaram
Erik Waingarten
Tian Zhang
+ Low-quality dimension reduction and high-dimensional Approximate Nearest Neighbor ∗ 2015 Evangelos Anagnostopoulos
Ioannis Z. Emiris
Ioannis Psarros
+ A Refined Analysis of LSH for Well-dispersed Data Points 2016 Wenlong Mou
Liwei Wang