Open YanLiang1102 opened 6 years ago
For Khaled
Do you need to do this first? Or does it just create the folders? (seems to just be a nice way of setting things up)
"SHOULD I EVER UPDATE THE GLOBAL DATA?" (link is broken) create_tokenizer. What's the "vocab Vocab"
https://stackoverflow.com/questions/47219639/spacy-2-0-ner-training something might be useful
Things need to do:
Train Arabic language Model
Train Arabic Ner Model
using ontoNotes together with the prodigy data we have, we should be able to get like 66K records of training data, we need to writ e a customized ner model for Arabic in Spacy and get it trained.
@ahalterman @khaledJabr