AtlasOfLivingAustralia / biocache-store

Occurrence processing, indexing and batch processing
Other
7 stars 24 forks source link

Record license #374

Closed charvolant closed 4 years ago

charvolant commented 4 years ago
elywallis commented 4 years ago

Hi Doug, comment/query about presentation style for records in Biocache re licences. Museums Victoria supplies a URL as its licence statement instead of just a shorthand "CC-0" - will you be able to account for those? See https://biocache.ala.org.au/occurrences/c0df6f93-977f-4c39-a2a6-654e5d418c2e

Actually I see that iNaturalist looks like it provides the licence as a URL as well so you've probably go that covered https://biocache.ala.org.au/occurrences/8368efa8-78be-41a0-b514-e99e315e2be8

Note also that Museums Victoria assigns a separate licence to any image attached to the record - trying to get across the principle that the data is not copyrightable (hence CC-0) but that the image is (hence CC-BY or CC-BY-NC) What is the image store doing with licence statements - do they just pass through?

charvolant commented 4 years ago

Hi @elywallis

The regular expressions used match a pattern, so CC-0 will match .*(cc|creativecommons)(-)?0.* as will CC0 CreativeCommons-0 etc. We may have to tweak these as special cases come our way. Similarly, http://creativecommons.org/licenses/by-nc/4.0/ will match .*(cc|creativecommons).*by.*nc.*(4\.0)?.*(int)?.* and map it on to CC-BY-NC 4.0 (Int)

We may need to do some additional work on these as other edge cases appear. But the patterns are in a text file.

If the MV stuff comes in as a DwCA, the seperate licence stuff comes in via the multimedia extension and gets passed on through to the image service.