What is the most efficient way to search/index for nearest neighbor/query within radius #528
nawarhalabi
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am trying to implement two functions:
Here is my ideas for both problems:
a. Using
k_ring
with an (exponentially) increasing radius from target until I find an intersection between the set of hex ids in thek_ring
with the hex ids in the original data. Problem is thatk_ring
gets slower for higher radius. Performance for this was much faster (10 fold on average) withkdtree
orballtree
b. Using multiple indexes with different resolutions: Problem here is that keeping track of all hex ids in different resolutions for hundreds of millions of data is disk space consuming
a. Using
k_ring
with a fixed radius: Same issue as above, I havek_ring
taking in the order of 100s of milliseconds for radius 50 for example. This is too slowb. Using multiple indexes: Same issue as above
IDEA Is using a tree-like index which allows efficient prefix queries on string an option to use with the hex ids?
Happy to hear your thoughts
Beta Was this translation helpful? Give feedback.
All reactions