Closed prashant334 closed 6 years ago
Yes, spaCy's NER (and other models) uses subword features, although it doesn't use a character-based CNN to extract them. Instead, the word vectors are learned by concatenating embeddings of NORM
, PREFIX
, SUFFIX
and SHAPE
lexical attributes. A hidden layer is then used to allow a non-linear combination of the information in these concatenated vectors. The function for this can be found in spacy._ml.Tok2Vec
.
The best reference for this embedding strategy is currently the NER algorithm video: https://www.youtube.com/watch?v=sqDHBH9IjRU
To add to @honnibal's comment above, there's also a section in the API docs that describes the neural network model architecture in more detail: https://spacy.io/api/#nn-model
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Python version: 2.7.6 Platform: Linux-3.16.0-77-generic-x86_64-with-Ubuntu-14.04-trusty spaCy version: 2.0.0a17 Models: en, en_core_web_sm, xx_ent_wiki_sm
during training of model. does charCNN used for capturing morphological features from characters?