marekrei / sequence-labeler

Neural network sequence labeling model
252 stars 74 forks source link

Could you provide some parts of the data? #1

Closed GuangChen2016 closed 6 years ago

GuangChen2016 commented 7 years ago

@marekrei Hello, marekrei. It's very nice work, and I want to reproduce your work. But I have some problem in preparing the required format. So could you please provide some parts of the data for me? Thank you very much.

marekrei commented 7 years ago

Hi @GuangChen2016 I'm in the process of releasing the error detection dataset. It should be online by next week. Until then, the input files should look like this:

This        c
is          c
an          i
sentence    c

In this case the task is error detection and there are two possible labels - c and i. But you can replace that with POS tags, NER tags, etc. Make sure to leave an empty line between sentences. I hope this helps.

GuangChen2016 commented 7 years ago

@marekrei Thank you.

liuyichaosoftware commented 7 years ago

hi marekrei, I would like to reproduce your work, but I have a problem to get the FCE data, could you please give me a help with the data? thank you very much.

marekrei commented 7 years ago

The error detection data is now available here: http://www.ilexir.co.uk/datasets/index.html

You might also be interested in the blog post I wrote about this topic: http://www.marekrei.com/blog/attending-to-characters-in-neural-sequence-labeling-models/

liuyichaosoftware commented 7 years ago

thank you very much~