Open willbeason opened 2 weeks ago
Just realized I forgot - there should be a "source" enum as well that specifies the actual object processing was run on (LaTeX/tei/etc.). That's an additional primary key on all tables.
Yeah, the UUID is the article (aka "work") but a single article can have multiple sources, so we'll need both.
Are you able to map everything to this documentation: https://github.com/softcite/software-mentions/blob/master/doc/annotation_schema.md
Here's the normalized form of the software mentions files. Note that bounding boxes for references must be in their own table since their unique key additionally requires the id of the reference within the paper.
I'll need to get clarity on what some of these fields mean.
Bolded entries form a primary key, possibly composite.
PaperSchema
MentionsSchema
PagesSchema
ReferencesSchema
MentionsBoundingBoxes
ReferencesBoundingBoxes