Open vanitu opened 4 years ago
As I understand reindexing after sorting may help. df.index = Daru::Index.new(Array.new(df.size) { |i| i })
I'm running into a similar issue that occurs when you remove rows from a dataset using filter
before calling group_by
- it looks like get_group
does not respect non-standard indices on rows, so grouping operations will only work if your rows are indexed the default way (zero-based, consecutive integers). I don't know the Daru internals well, but the issue appears to be here: https://github.com/SciRuby/daru/blob/v0.2.2/lib/daru/core/group_by.rb#L258-L267
The conversion of @context
to elements
throws away @context
's original indices, and references in to elements.transpose
assume that the indices are the defaults (i.e. 0, 1, 2, 3, ...
).
When group_by applied on sorted DataFrame get_group will return wrong entries in DataFrame