Determine the index data structure to be stored in Riak Object

clr / riak_geo

Geospatial indexing built on the Riak Distributed Database

0 stars 2 forks source link

The indices will be initially partitioned and distributed throughout the cluster based on their respective HEALPix pixel ID's. However that only gets us all the entries in the index at the location and resolution of the pixel. Essentially a first-order analysis.

A second-order analysis will need to be performed on the entries actually stored in the index to satisfy both range and nearest neighbor queries.

A naive approach would be to store the entries with their position as a list and then iterate through them checking against a reference point and maintaining a resulting list of matches.

That will become extremely inefficient with larger index sizes. This could be improved upon through either creating a tree structure that pre-orders the indexed positions to make filtering faster, or by partitioning the list and iterating over sections of it in parallel recursively, or both.

Another challenge will be how to implement the index such that entries can be updated and/or deleted.

clr / riak_geo

Determine the index data structure to be stored in Riak Object #4