hunzikp / velox

https://hunzikp.github.io/velox/
119 stars 23 forks source link

Consider implementing "data-on-disk" operations #26

Open kendonB opened 6 years ago

kendonB commented 6 years ago

In my experience, raster can do anything in acceptable time with in-memory operations on machines with up to 200GB of memory.

What really is sluggish compared to optimal is extracting small subsets of data from very large raster stacks or very high resolution rasters.

When I last dug into the raster code, the main reason seems to be that the loading code was 1) written in an R for loop rather than C++, and 2) the code was loading entire rows into memory then subsetting rather than loading chunks of each row. velox would be a very nice place to optimize these sorts of extraction operations.