plazi / GoldenGATE-Imagine

A GUI Tool For Freeing Text and Data from PDF Documents
Other
5 stars 0 forks source link

include an XLS supplementary material into a DWCA #42

Open myrmoteras opened 1 year ago

myrmoteras commented 1 year ago

@gsautter what do you suggest to include this table (S1) into the DWCA of this article?

Article Tables.S1-S52.xlsx

s41559-023-02041-9.pdf

gsautter commented 1 year ago

Well, considering that the article proper doesn't really contain any treatments at all, and that said XLS has 50+ tabs, I think how to turn the XLS into a DwCA all on its own is the more sensible question to ask ... maybe an IPT might be able to pull that off, not sure, never used this thing.

This is definitely beyond the "How to include supplementary tables with an article" kind of question, as it's more like a vast amount of data with a tiny article written on top of it (like so often in Nature).

myrmoteras commented 1 year ago

This is exactly what we discussed at GBIF in Copenhagen. There is only S1 which is relevant here with the 2000+ specimens that is relevant.

There is also in one of the sheets - Joe seems to have found it - where the tree is in form of a nexus file that we want to include in the DWCA.

Also, each of the material citation is linked to a tip in the phylogeny - which is a very important information only we can provide at this moment - and we should make it happen. So it is opening up a specimen to the tree.

This is a very relevant piece of work, and if we can get it done, would be very helpful.

It is not in PMC so this will be another interesting example for biodiversityPMC if we get it into JATS so we can import into SIBiLS.

gsautter commented 1 year ago

Description of tree format: https://en.wikipedia.org/wiki/Newick_format