-
Thanks for sharing.
I want to train a different language model (Hindi).
How did you train your bert-base-italian-* models? Are those steps covered anywhere?
-
Indic
- [x] Hindi - To be done later
- [ ] Gujarati - To be done later
- [ ] Tamil
- [ ] Telegu
- [ ] Bengali
Asian
- [ ] Chinese
- [ ] Korean
- [ ] Japanese
We should be able to add con…
-
I have been using the CLI training on the spacy-nightly versions. They are extremely powerful. There are 2 suggestions from my side:
1. The choice of JSON as the dataset extension doesn't scale wel…
-
To make it easier for the spaCy community to contribute to new languages, I've started adding language data skeletons, including the language class setup and the minimum amount of data required to mak…
-
## Feature description
[Universal Language Model Fine-tuning for Text Classification](https://arxiv.org/pdf/1801.06146.pdf) presented a novel method to fine tune a pre-trained universal language m…
-
## How to reproduce the behaviour
Basically following the tutorial in https://spacy.io/usage/linguistic-features#sbd-component with the addition of calling has_pipe() in a loop:
```
import spacy
f…