twitter-archive / torch-dataset

An extensible and high performance method of reading, sampling and processing data for Torch
Apache License 2.0
76 stars 24 forks source link

IndexCSV Datasets are not partitioned correctly #17

Closed willfrey closed 8 years ago

willfrey commented 8 years ago

I'm trying to split a dataset with 12800 examples across four nodes. Instead of each node receiving 3200 examples, it appears that they receive 0, 12771, 27, and 2, respectively.

Can you help me understand this behavior and try to resolve it?

Thanks.