Thank you for sharing such a large and saturated with annotations collection!
Since the original collection represent a BRAT-formatted document, for the quick-starting cases and work with relations, it might be found in writing an addtional service for parsing and extracting text parts with mentioned relations in it.
To address this limitation, I am writing to contribute and propose a handy and quick solution for a quick extraction of most relations between mentioned objects just within a single command line with the following opensource framework:
Basically, it converts the BRAT-based representation of NEREL collection into jsonl.
Other formats, such as csv or sqlite3, entities masking, are supported and the complete list of the formats could be found here
Proposal for a quick README modification
I hope this is both for the beneficial for a quick application of your collection by other as well as personal interest in maintaining opensource solutions to contribute in studies, based on semantic relations in texts.
Here is an example on how to add the reading info into the README:
[![](https://img.shields.io/badge/AREkit--ss_Compatible-0.23.1-purple.svg)](https://github.com/nicolay-r/arekit-ss#usage)
> π **Update 25 October 2023**: this collection **is now available in [arekit-ss](https://github.com/nicolay-r/arekit-ss)**
> for a [quick sampling](https://github.com/nicolay-r/arekit-ss#usage) of contexts with most subject-object relation mentions with just **single script into
> `JSONL/CSV/SqLite`** including (optional) language transfering π₯ [[Learn more ...]](https://github.com/nicolay-r/arekit-ss#usage)
Which will look as follows:
π Update 25 October 2023: this collection is now available in arekit-ss
for a quick sampling of contexts with most subject-object relation mentions with just single script into
JSONL/CSV/SqLite including (optional) language transfering π₯ [Learn more ...]
Dear resource maintaners,
Thank you for sharing such a large and saturated with annotations collection! Since the original collection represent a BRAT-formatted document, for the quick-starting cases and work with relations, it might be found in writing an addtional service for parsing and extracting text parts with mentioned relations in it. To address this limitation, I am writing to contribute and propose a handy and quick solution for a quick extraction of most relations between mentioned objects just within a single command line with the following opensource framework:
Basically, it converts the BRAT-based representation of NEREL collection into
jsonl
. Other formats, such ascsv
orsqlite3
, entities masking, are supported and the complete list of the formats could be found hereProposal for a quick README modification
I hope this is both for the beneficial for a quick application of your collection by other as well as personal interest in maintaining opensource solutions to contribute in studies, based on semantic relations in texts.
Here is an example on how to add the reading info into the README:
Which will look as follows: