bio-guoda / preston

a biodiversity dataset tracker
MIT License
24 stars 1 forks source link

hot off the press - Elliott et al. Signing data citations enables data verification and citation persistence. Sci Data 10, 419 (2023). https://doi.org/10.1038/s41597-023-02230-y hash://sha256/f849c870565f608899f183ca261365dce9c9f1c5441b1c779e0db49df9c2a19d #250

Closed jhpoelen closed 7 months ago

jhpoelen commented 1 year ago

I am pretty excited to be listed as a co-author on our recent paper [1]. I believe that our paper has some roots in classic 1945 essay "As We May Think" by Vannevar Bush [2] as well as the 1974 Computer Lib/Dream Machines by Ted Nelson [3] - in advocating for a more "intertwingled" approach to recording the complex webs of the knowledge we keep and explore.

Big thanks to @mielliott and many others that have contributed to the development of these (not so new?) ideas, including, but not limited to @btrask @cboettig @seltmann @slint @mbjones @tkuhn, @Daniel-Mietchen and more recently, @GregPost-ASU, @nfranz, and @n8upham. Also, thanks for open infrastructures such as Zenodo @slint (#210 ), The Internet Archive, Software Heritage Library (#70 ) @zacchiro @rdicosmo, DataOne (#181 ) @mbjones, and last but not least, GitHub and their excellent platform to develop open source software/data*.

Please do share your thoughts on the topic of building a media and format agnostic content-addressed knowledge universe.

References

[1] Elliott, M.J., Poelen, J.H. & Fortes, J.A.B. Signing data citations enables data verification and citation persistence. Sci Data 10, 419 (2023). https://doi.org/10.1038/s41597-023-02230-y hash://sha256/f849c870565f608899f183ca261365dce9c9f1c5441b1c779e0db49df9c2a19d [2] Bush, V.. 1945. As We May Think, The Atlantic . See also https://en.wikipedia.org/wiki/As_We_May_Think . [3] Nelson, T. 1972. Computer Lib/Dream Machines. See also https://en.wikipedia.org/wiki/Computer_Lib/Dream_Machines and https://archive.org/details/computer-lib-dream-machines .

* I hope, at some point, that GitHub itself could become a little more open, including having a way to easily re-publish issues in archives or other external platforms (#156 ).

mbjones commented 1 year ago

Congrats @jhpoelen, I read it yesterday and it's a great paper. I've already forward it to several folks. Nice work! 🎉

cboettig commented 1 year ago

Thanks for sharing @jhpoelen , this is wonderful.

Regarding the footnote about GitHub -- where do you see SoftwareHeritage fitting into this? They can already resolve SHA-256 hashes to a substantial fraction of content found on GitHub, as well as archival copies of said content... (Though they do impose rate limiting on their API, for obvious reasons).

jhpoelen commented 1 year ago

Thanks @cboettig !

Regarding the footnote about GitHub -- where do you see SoftwareHeritage fitting into this? They can already resolve SHA-256 hashes to a substantial fraction of content found on GitHub, as well as archival copies of said content... (Though they do impose rate limiting on their API, for obvious reasons).

Software Heritage does a great job in making content in repositories accessible. And, in the footnote, I was referring to valuable content in GitHub issues, not just the content in their versioned repositories. I wonder whether Software Heritage is interested in archiving GitHub issues as well as the digital content in their repositories.

jhpoelen commented 7 months ago

closing publication announcement issue.

[1] Elliott, M.J., Poelen, J.H. & Fortes, J.A.B. Signing data citations enables data verification and citation persistence. Sci Data 10, 419 (2023). https://doi.org/10.1038/s41597-023-02230-y hash://sha256/f849c870565f608899f183ca261365dce9c9f1c5441b1c779e0db49df9c2a19d

is already being cited . . . :tada: