#89 --- RFB gold processing

clamsproject / aapb-annotations

Repository to store manual annotation dataset developed for CLAMS-AAPB collaboration

3 stars 0 forks source link

#89 --- RFB gold processing #91

Open MrSqually opened 1 week ago

MrSqually commented 1 week ago

This PR is to add a process.py file for converting a directory of gold JSON annotations into a single CSV file. This satisfies the first two objectives of #89.

additions

role-filler-binding/process.py
gold files generated from process.py on 231117-aapb-annotations-44 data

discussion before merging

is this a suitable format for the data (newline-separated CSV)?
do we want to discard any of the non-RF data from this batch? do we actually want all these duplicates?

keighrim commented 1 day ago

For the second discussion point, I think we should keep the duplicate labels, since they are also human generated (valuable) data that have potential usages.