A Dimension Reduction Technique to Preserve Nearest Neighbors on High Dimensional Data
Author | : Christos Nestor Chachamis |
Publisher | : |
Total Pages | : 74 |
Release | : 2020 |
Genre | : |
ISBN | : |
Download A Dimension Reduction Technique to Preserve Nearest Neighbors on High Dimensional Data Book in PDF, Epub and Kindle
Dimension reduction techniques are widely used for various tasks, including visualizations and data pre-processing. In this project, we develop a new dimension-reduction method that helps with the problem of Approximate Nearest Neighbor Search on high dimensional data. It uses a deep neural network to reduce the data to a lower dimension, while also preserving nearest neighbors and local structure. We evaluate the performance of this network on several datasets, including synthetic and real ones, and, finally, we compare our method against other dimension reduction techniques, like tSNE. Our experiment results show that this method can sufficiently preserve the local structure, in both the training and test data. In particular, we observe that most of the distances of the predicted nearest neighbors in the test data are within 10% of the distances of the actual nearest neighbors. Another advantage of our method is that it can easily work on new and unseen data, without having to fit the model from scratch.