neulab / ExplainaBoard

Interpretable Evaluation for AI Systems
MIT License
360 stars 36 forks source link

New Task: Recognition of General BRAT Span Format #158

Open neubig opened 2 years ago

neubig commented 2 years ago

BRAT is an annotation tool that covers a wide variety of NLP analysis tasks by expressing them as annotation of spans and relations between the spans. This generalized format enables easy handling of a bunch of tasks, such as in edge probing and GLAD. It would be nice to implement this in ExplainaBoard. Here is a roadmap towards doing so, aimed at people who are getting started with ExplainaBoard so it can be a good first issue.

First, it would be a good idea to read the tutorials on implementing new tasks, features, and formats.

A first good step would be to add BRAT format input to NER as a different option in addition to CoNLL. You can start by looking at the NER Loader, and:

Once this is done, we can:

A good pointer for where to do this is the conditional generation, machine translation, and summarization processors. Summarization and machine translation are specific instantiations of the general task of conditional generation, so this can give a template of a general task (span identification) and specific instantiations (NER, frame semantic parsing, chunking, etc.).

divija96 commented 2 years ago

@neubig I'd like to take this up. Please could you add me as an assignee.

pfliu-nlp commented 2 years ago

Hi, @divija96 thanks for your interest, just did it.

woshiyyya commented 2 years ago

Hi @pfliu-nlp ,

@divija96 and I will be working on this issue, can you add me to the assignees as well? Thanks!

neubig commented 2 years ago

↑done