neo4j-contrib / spatial

Neo4j Spatial is a library of utilities for Neo4j that faciliates the enabling of spatial operations on data. In particular you can add spatial indexes to already located data, and perform spatial operations on the data like searching for data within specified regions or within a specified distance of a point of interest. In addition classes are provided to expose the data to geotools and thereby to geotools enabled applications like geoserver and uDig.
http://neo4j-contrib.github.io/spatial
Other
780 stars 192 forks source link

improve performance of spatial.import() for large data sets #385

Open jamesdhope opened 3 years ago

jamesdhope commented 3 years ago

I ran an import of the London OSM data from Planet on a 2 Core machine with 4GIG RAM. OSM Nodes and Ways imported within 3-4 hours. After 30 hours Relations were still being imported. In the end I had to kill the process.

It was reassuring that the import ran without memory overflow on such a small machine (possibly using SWAP i'm not sure) but the speed at which relations were being imported was the main issue for us and would likely be too slow to maintain a large OSM dataset. Indeed, one option would be to run this on a larger machine and further testing would be required to see if this would be performant enough for our needs.

It would be good to see if further optimisation of the import routine could be made to improve performance particularly with regard to OSM Relations.