Optimal Hashing-based Time-Space Trade-offs for Approximate Near Neighbors
September 14, 2016
I will show a new data structure for the Approximate Nearest Neighbor problem for Euclidean and Hamming distances, which has the following benefits:
* It achieves a smooth time-space trade-off, with two extremes being *near-linear* space and *sub-polynomial* query time. * It unifies, simplifies, and improves upon all previous data structures for the problem. * It is optimal in an appropriate restricted model.The data structure can be seen as a combination of Spherical Locality-Sensitive Filtering and *data-dependent* Locality-Sensitive Hashing. Joint work with Alexandr Andoni (Columbia), Thijs Laarhoven (IBM Research Zurich) and Erik Waingarten (Columbia). The preprint is available as http://arxiv.org/pdf/1608.03580v1.pdf .