Closed skaulintel closed 2 years ago
Hi, I didn't encounter error like this. May be you can try :
requirements.txt
Hi @skaulintel
Recently I updated hf/datasets for another project and found the same problem in a similar setting. After some debugging, I found a potential bug in hf/datasets (https://github.com/huggingface/datasets/pull/3782). And I have modified data processor here by turn disable_nullable
to False here. https://github.com/richarddwang/electra_pytorch/blob/ba35cf6b85ba1c3264c44c2f67e18d46d5e84f52/_utils/utils.py#L124
It should work now. If there is any else I can help, please tag me to reopen the issue.
Hi Richard,
I get the following pyarrow issue when trying to load the openwebtext corpus dataset>
Any ideas?
Best, Shiv