Closed coltonflowers1 closed 3 years ago
Thought I should also add that I am working in a Databricks notebook.
Hm, that's weird. How long have you waited while it was hanging?
The easiest thing to try, to start debugging this, is to create a much smaller sample of your data. It's probably easiest to just cut the (copied) jsonl files after a few examples, export them again to much smaller binary files, and try the training again with otherwise the exact same parameters. I'd expect that would run through? (with obviously bad accuracy, but never mind that for now)
Thank you for your reply @svlandeg . If I do the same from a web terminal using the same OS, I don't get the same hanging issue, so I think that this may be an issue with Databrick's notebooks not correctly fetching the results and not a Spacy issue
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
How to reproduce the behaviour
I have been trying to train a blank Spacy pipeline with NER and Transformer components.
First, I do the set-up/installation for pytorch and spacy provided on the embeddings-transformers page with cuda 10.1:
After exporting my prodigy dataset of facebook posts (which each possibly contain more than one sentence) using the data-to-spacy recipe and then converting the resulting json files to .spacy files using:
python -m spacy convert "/dbfs/FileStore/train-data.json" "training"
python -m spacy convert "/dbfs/FileStore/eval-data.json" "training"
I pass the following spacy train command using verbose mode, with full_config.cfg produced by taking the default base cfg for ner with GPU/transformers provided by the Quickstart widget and autofilling using
init fill-config
.-m spacy train /dbfs/FileStore/full_config.cfg --paths.train training/train-data.spacy --paths.dev training/eval-data.spacy --gpu-id 0 --nlp.batch_size 64 -V
and the command hangs after it has printed the "Loading corpus" messages.
with full_config.cfg given here:
Your Environment