Open thremilien opened 1 year ago
Same problem, any ideas?
it looks like windows is deleting files that contains js shellcode exploits causing the load to fail.
Same problem, any idea?
Set num_proc = 1 and shut down All Windows Virus & threat protection and Firewall &network protection solved the problem.
Same issue here.
Hello I've an issue while loading my dataset in prepare.py (for obenwebtext). The download and the extraction complete successfully but the generation of train split raise an error.
I've already try to look for the file 0180327-a95f1342cd685fb7d22805aa720870d2.txt in the archive and add it manually to the extracted dataset but it doesn't work. The ignore_verification is False.
If you need more informations I can give you whatever you need
Thanks for your help
Config :