blaze / castra

Partitioned storage system based on blosc. **No longer actively maintained.**
BSD 3-Clause "New" or "Revised" License
153 stars 21 forks source link

Add extend_sequence #40

Closed jcrist closed 9 years ago

jcrist commented 9 years ago

Performs repartitioning of sequence of dataframes by freq.

freq can be either a pandas offset string, or None. If None, extend_sequence will do minimal repartitioning to ensure partitions don't split duplicate indices (i.e. [[1, 2, 3], [3, 4, 5]] -> [[1, 2, 3, 3], [4, 5]]).

Addresses #3, and part of #36.

jcrist commented 9 years ago

Addressed the comments above. Could use another review.

mrocklin commented 9 years ago

Was looking at it just now. It looks good to me