XuezheMax / LasagneNLP

NLP tools on Lasagne
Apache License 2.0
61 stars 31 forks source link

Data format for NER task #8

Closed khoaipx closed 7 years ago

khoaipx commented 7 years ago

Thank you so much for making this code available. Could you teach me about data format for NER task? (data/conll2003/eng.train.conll-eng.dev.conll-eng.test.conll aren't pushed in this repo.)

XuezheMax commented 7 years ago

Hi, Here is an instance in the data

After IN I-PP O the DT I-NP O frustration NN I-NP O of IN I-PP O seeing VBG I-VP O the DT I-NP O opening VBG I-NP O day NN I-NP O of IN I-PP O their PRP$ I-NP O match NN I-NP O badly RB I-ADVP O affected VBN I-VP O by IN I-PP O the DT I-NP O weather NN I-NP O , , O O Kent NNP I-NP I-ORG stepped VBD I-VP O up RP I-PRT O a DT I-NP O gear NN I-NP O to TO I-VP O dismiss VB I-VP O Nottinghamshire NNP I-NP I-ORG for IN I-PP O 214 CD I-NP O . . O O

Due to the licence, I cannot upload the data to the repo.

On Tue, Feb 28, 2017 at 2:33 AM, khoaipx notifications@github.com wrote:

Thank you so much for making this code available. Could you teach me about data format for NER task? (data/conll2003/eng.train. conll-eng.dev.conll-eng.test.conll aren't pushed in this repo.)

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/XuezheMax/LasagneNLP/issues/8, or mute the thread https://github.com/notifications/unsubscribe-auth/ADUtlgw7Ho4oWrXQSyjBdC00JLOUoxM5ks5rg83DgaJpZM4MODza .

--

Best regards, Ma,Xuezhe Language Technologies Institute, School of Computer Science, Carnegie Mellon University Tel: +1 206-512-5977

khoaipx commented 7 years ago

Thank you so much for reply. I know about limit of the dataset's licence.