wpoa / open-access-media-importer

A tool for harvesting media files from Open Access articles for upload into Wikimedia Commons
http://commons.wikimedia.org/wiki/User:Open_Access_Media_Importer_Bot
24 stars 8 forks source link

CC0-licensed articles at PLOS #8

Open Daniel-Mietchen opened 12 years ago

Daniel-Mietchen commented 12 years ago

Currently, we set permission= {{PLOS}} for all PLOS articles. This defaults to CC BY 2.5, but quite a few PLOS articles are actually CC0, so that should be indicated. Perhaps best to add a license parameter to the PLOS template.

erlehmann commented 11 years ago

Postponed until I see the updated template.

Daniel-Mietchen commented 11 years ago

Just noticed that 10.1371/journal.ppat.1002998 is CC0 but not being processed:

Removing “/home/daniel/.local/share/open-access-media-importer/pmc_doi.sqlite” … done. Input DOIs, delimited by whitespace: Getting PubMed Central IDs for given DOIs … found: 3486900 Downloading “http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=pmc&id=3486900”, saving into directory “/home/daniel/.cache/open-access-media-importer/metadata/raw/pmc_doi” … 100% |##########################################################################################################################################################################################################| Unknown license: This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. /usr/lib/python2.7/dist-packages/sqlalchemy/engine/default.py:463: SAWarning: Unicode type received non-unicode bind param value. param.append(processorskey) “Identification of a Novel Splice Variant Form of the Influenza A Virus M2 Ion Channel with an Antigenically Distinct Ectodomain”: 3 × video/quicktime 1 × image/tiff

Checking MIME types … 100% |##########################################################################################################################################################################################################| Unknown, possibly non-free license: Unknown, possibly non-free license: Unknown, possibly non-free license: Unknown, possibly non-free license:

Daniel-Mietchen commented 11 years ago

The license text that was not recognized above (with "Creative Commons CC0 public domain dedication" as the key element) seems to be roughly as common ( cf. http://www.plosone.org/search/advancedSearch.action?startPage=0&volume=&eLocationId=&id=&filterArticleType=&filterKeyword=&unformattedQuery=%28%28%28%28supporting_information%3Avideo%29+OR+supporting_information%3Amovie%29+OR+supporting_information%3Aaudio%29+OR+supporting_information%3Asound%29+AND+everything%3Acc0&sort=Relevance&pageSize=50 ) as "Creative Commons Public Domain declaration" (cf. http://www.plosone.org/search/advancedSearch.action?startPage=0&volume=&eLocationId=&id=&filterArticleType=&filterKeyword=&unformattedQuery=%28%28%28%28supporting_information%3Avideo%29+OR+supporting_information%3Amovie%29+OR+supporting_information%3Aaudio%29+OR+supporting_information%3Asound%29+AND+everything%3A%22%5C%22Creative+Commons+Public+Domain+declaration%5C%22%22&sort=Relevance&pageSize=50 ).

Daniel-Mietchen commented 11 years ago

An example file from a "Creative Commons Public Domain declaration" article is at http://commons.wikimedia.org/wiki/File:Microglial-Morphology-and-Dynamic-Behavior-Is-Regulated-by-Ionotropic-Glutamatergic-and-GABAergic-pone.0015973.s001.ogv .

An example file from a "Creative Commons CC0 public domain dedication" article is at http://commons.wikimedia.org/wiki/File:Connexin43-Modulates-Cell-Polarity-and-Directional-Cell-Migration-by-Regulating-Microtubule-Dynamics-pone.0026379.s003.ogv .

Daniel-Mietchen commented 11 years ago

To sum up, I think we should search the content of the tags in PLOS articles for "Creative Commons" and "public domain" (irrespective of capitalization), and if both can be found, we put permission= {{PLOS|CC0}} instead of permission= {{PLOS}}. This will not affect the functionality of the {{PLOS}} template as it is now, but I will adapt it such that it will display a CC0 licensing for anything with permission= {{PLOS|CC0}}.

erlehmann commented 11 years ago

Probably fixed as of d02a8b40eaaef2333055a3386c46e174c5024a17 and 4c7ded40e54e89507b1acb10f7c94374c2dd2468.

Daniel-Mietchen commented 10 years ago

CC0 articles at PLOS still seem to be recognized as "unknown, possibly non-free license". Recent examples: 10.1371/journal.pone.0104186 10.1371/journal.pone.0103152