facebookresearch / fastText

Library for fast text representation and classification.
https://fasttext.cc/
MIT License
25.9k stars 4.71k forks source link

fastText for NER #531

Closed PrashantRanjan09 closed 6 years ago

PrashantRanjan09 commented 6 years ago

I read that using fastText embeddings can improve the NER. Can you please tell me how can we use fastText embeddings to get the NERs?

loretoparisi commented 6 years ago

@PrashantRanjan09 can you point us to the referred paper?

sughodke commented 6 years ago

It must be this one. https://arxiv.org/abs/1709.04820

EdouardGrave commented 6 years ago

Hi @PrashantRanjan09,

A standard approach to improve named entity recognition with continuous word representations is to use word embeddings, trained on large amount of unlabeled data, as feature in the NER system. Examples of this approach can be found in the following papers (in addition to the one from @sughodke's comment):

This general idea was previously explored with discrete word representation (based on word clustering) instead of word embeddings:

Best, Edouard.

DavidGOrtega commented 6 years ago

I would recommend this paper also which is the foundation of polyglot ner

https://arxiv.org/pdf/1410.3791.pdf

omerarshad commented 6 years ago

any code available for https://arxiv.org/abs/1709.04820?