SymbioticLab / FedScale

FedScale is a scalable and extensible open-source federated learning (FL) platform.
https://fedscale.ai
Apache License 2.0
383 stars 119 forks source link

Potential dataset #109

Open mosharaf opened 2 years ago

mosharaf commented 2 years ago

https://aml.engr.tamu.edu/book-dswe/dswe-datasets/

@dywsjtu can you look it up? Most of them are really small, but the 4. Wind Spatio-Temporal Dataset2 can be useful with 200 clients.

We can discuss here.

fanlai0990 commented 1 year ago

+1. We should add MovieLens1M and Yelp for the emerging recommendation task. Please help when you are free (@dywsjtu ). Thanks!

Also, we should plan for cross-silo datasets. Do you think we should add those GDA network b/w and latency traces used in Sol? (@mosharaf ) If so, I can take the system trace part.

mosharaf commented 1 year ago

Definitely, we should keep adding datasets.

As for cross-silo, it should certainly be in the roadmap. Before that however, we need to fix simple things like data format #118