urbanairship / datacube

Multidimensional data storage with rollups for numerical data
http://urbanairship.com
Apache License 2.0
264 stars 62 forks source link

Space efficiency #33

Open drevell opened 12 years ago

drevell commented 12 years ago

We spend a lot of space on storing row keys. We should figure out a way to reduce this. Possibilities:

We'll want to preserve backward compatibility.

eonnen commented 12 years ago

Do we really though? Won't compression at the block level kick in and effectively remove a lot of the redundancy in keys?

drevell commented 12 years ago

Yeah, we can assume HBase compression to reduce a lot of the cruft. Theoretically datacube is not HBase-specific though :)

This is something I'll code on my own time rather than UA's time since space consumption isn't a problem for us, and we don't have any plans for non-HBase backends.