clamsproject / aapb-annotations

Repository to store manual annotation dataset developed for CLAMS-AAPB collaboration
3 stars 0 forks source link

`aapb-collaboration-7` batch contains *way* too many GUIDs than actual annotation #79

Closed keighrim closed 4 months ago

keighrim commented 4 months ago

Bug Description

Not sure this is a bug, but I found that there are ~4k GUIDs in the batch, while the slate annotation only had been done on ~850 of those videos. To be clear, januaray-slates is the only annotation project that the batch is used.

This thread to discuss future plans for the batch. Specifically,

  1. Do we want to continue "slate" annotation on the same batch?
  2. Do we want to re-use the batch (as a whole, 4k videos) in another concretely planned annotation project in a visible future?

If the answer is no & no, I think we can just reduce the batch files to those ~850 lines.

Reproduction steps

$ wc -l batches/aapb-collaboration-7.txt 
4019 aapb-collaboration-7.txt

$ ls january-slates/golds/*.tsv | wc -l 
848

Expected behavior

No response

Screenshots

No response

Additional context

Original source of the batch compilation: https://github.com/clamsproject/aapb-collaboration/issues/7