creativecommons / quantifying

quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain
MIT License
24 stars 34 forks source link

Automating Quantifying the Commons (Google Summer of Code 2024) #123

Closed naishasinha closed 3 months ago

naishasinha commented 3 months ago

Description

This pull request documents all of the comprehensive contributions made towards the completion of Automating Quantifying the Commons during the 2024 Google Summer of Code program.

Context

Quantifying the Commons, an initiative pioneered by the UC Berkeley Data Science Discovery Program, aims to quantify the frequency of open domain and Creative Commons license usage for future accessibility and analysis purposes. Till date, prior advancements for this endeavor did not include automation or integrated reporting, both of which are crucial to minimize the potential for human error and facilitate timely updates — especially for handling substantial data streams. The primary objective of this project was to develop automation software for data collection, processing, and report generation, ensuring that the Quantifying reports are consistently updated and are never three months out-of-date.

For more details on the implementation and deliverables, please refer to my final report.

Checklist

Developer Certificate of Origin

For the purposes of this DCO, "license" is equivalent to "license or public domain dedication," and "open source license" is equivalent to "open content license or public domain dedication."

Developer Certificate of Origin ``` Developer Certificate of Origin Version 1.1 Copyright (C) 2004, 2006 The Linux Foundation and its contributors. 1 Letterman Drive Suite D4700 San Francisco, CA, 94129 Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. Developer's Certificate of Origin 1.1 By making a contribution to this project, I certify that: (a) The contribution was created in whole or in part by me and I have the right to submit it under the open source license indicated in the file; or (b) The contribution is based upon previous work that, to the best of my knowledge, is covered under an appropriate open source license and I have the right under that license to submit that work with modifications, whether created in whole or in part by me, under the same open source license (unless I am permitted to submit under a different license), as indicated in the file; or (c) The contribution was provided directly to me by some other person who certified (a), (b) or (c) and I have not modified it. (d) I understand and agree that this project and the contribution are public and that a record of the contribution (including all personal information I submit with it, including my sign-off) is maintained indefinitely and may be redistributed consistent with this project or the open source license(s) involved. ```