jpgard / federated

Federated learning experiments
MIT License
0 stars 1 forks source link

Generate dataset #7

Closed jpgard closed 4 years ago

jpgard commented 4 years ago

Fixes data-joining issues which occurred (silently) due to improper indexing and joining of different dataframes; these have been resolved and also made explicit (if they do occur) through use of inner rather than left joins.

Additionally, this PR explicitly conducts the sharding of training data into multiple smaller CSV files (100 by default).