Open cmdlineluser opened 4 weeks ago
Right, field
expansions writes expanded expressions. This is correct, but now we get the effects of calling shuffle
multiple times.
I think we must assign a seed upon shuffle
expression creation to circumvent these problems. We could do that implictly.
Hmm... it is more complicated. As we want to reseed groups in a group by. :thinking:
Checks
Reproducible example
Log output
No response
Issue description
Noticed while experimenting with https://github.com/pola-rs/polars/issues/18233
Using a struct should keep each "row" intact when shuffling:
Adding field extraction seems to break this:
It should be equivalent to
unnest
in this case?Expected behavior
field extraction and unnest should produce the same result?
Installed versions