Closed frankiejarrett closed 3 years ago
Hello @fjarrett !
Thanks for reporting this issue.
The error is most likely raised by the huggingface_hub
dependency that we use to interact with HuggingFace's models and datasets hub. More precisely, I suspect the error is raised from this function in AutoNLP, which clones the project's dataset repo on your machine. Indeed, cloning dataset repos is broken in the latest huggingface_hub
release (see this issue ).
We just released AutoNLP 0.3.2
that pins an anterior version of the huggingface_hub
package (namely, 0.12.0
). Would you mind updating AutoNLP and retrying to see if it solves your issue?
pip install -U autonlp
autonlp upload ...
@SBrandeis Your hunch was right, I upgraded to 0.3.2
and the uploads worked. I am training our project models now. Thank you for the assist!
@SBrandeis it looks like our training failed, I don't see an error code/message so not sure how to debug. Any chance someone can take a look and see what could be the issue? We have used this exact dataset on Amazon Comprehend in the past.
📁 training_set_out.csv (id # 385)
• Split: train
• Processing status: ✅ Success!
• Last update: 2021-06-30 13:58 Z
📁 validation_set_out.csv (id # 386)
• Split: valid
• Processing status: ✅ Success!
• Last update: 2021-06-30 13:59 Z
~~~~~~~~~~~~ Models ~~~~~~~~~~~
+----+--------+--------+--------------------+--------------------+
| | ID | Status | Creation date | Last update |
+----+--------+--------+--------------------+--------------------+
| ❌ | 302893 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302894 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302895 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302896 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302897 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302898 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302899 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302900 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302901 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
| ❌ | 302902 | failed | 2021-06-30 14:03 Z | 2021-06-30 14:12 Z |
+----+--------+--------+--------------------+--------------------+
@fjarrett looking into this now!
@SBrandeis thank you! and sorry for the head fake 😸
I had a look at the parameters of your project - the problem is that you're trying to train a model that's not compatible with 🤗 Transformers' TextClassification
pipeline.
You can browse the TextClassification
-compatible models here: https://huggingface.co/models?pipeline_tag=text-classification
If you want to try out that specific model on your use case, you might want to have a look at 🤗's Inference API if you haven't already: https://huggingface.co/inference-api
Let me know if it helps!
@SBrandeis I tried again this time using --hub_model textattack/facebook-bart-large-MNLI
instead since that model was tagged for TextClassification
but the training failed again
@SBrandeis trying again with --hub_model roberta-large-mnli
this time 🤞
That might not work either. For text-classification its best to select models which are not finetuned on a downstream task. Try: roberta-large :)
@SBrandeis ok trying that now
@abhishekkrthakur @SBrandeis I tried roberta-large
but that also failed. I finally resorted to training without any --hub_model
specified and that did work. Could it be that fine-tuning existing hub models is broken right now?
FAILED facebook/bart-large-mnli
FAILED textattack/facebook-bart-large-MNLI
FAILED roberta-large-mnli
FAILED roberta-large
Oh, it seems I was wrong. The roberta-large
run did, in fact, succeed. I must have been looking at the wrong project. Thank you both for your help! I learned a lot. 🙌
When attempting to upload a CSV training set for my model I receive a
JSONDecodeError
error. I tried uploading my smaller validation set too, but it also failed. I'm not entirely sure why JSON decoders are even being ran against a CSV file.At first I thought maybe the CSV was invalid, but it checks out. I am not sure how to debug this problem.
Any help is greatly appreciated! Thank you.
Valid CSV
Example CSV data
Upload attempt
Environment