aws-samples / awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
MIT No Attribution
205 stars 86 forks source link

Deprecated dataset "c4" #479

Open KeitaW opened 3 weeks ago

KeitaW commented 3 weeks ago
3:   warnings.warn(
2: dataset=c4, name=en
2: /home/ubuntu/.cache/huggingface/modules/datasets_modules/datasets/c4/584d57ebe81c209b6c7f31727066d2c4b4bba37cb7092cdd83083d5ec11207db/c4.py:53: FutureWarning: Dataset 'c4' is deprecated and will be deleted. Use 'allenai/c4' instead.

in FSDP test case.