WordPress / openverse-catalog

Identifies and collects data on cc-licensed content across web crawl data and public apis.
https://openverse.org
MIT License
59 stars 54 forks source link

Update Freesound to quarterly, extend timeout #1068

Closed stacimc closed 1 year ago

stacimc commented 1 year ago

Fixes

Related to WordPress/openverse#1308 by @AetherUnbound

Description

See discussion starting here on WordPress/openverse#1308 for context. This updates the schedule for Freesound to quarterly, given its extreme duration, and extends the timeout for the pull task to 50 days (from the existing 10).

50 days was chosen because the total count returned by the Freesound API when last checked was 593,447, and we were able to process 152,500 records in 10 days.

When this is merged another run in production can be kicked off starting at page 1150. We should hold off on actually starting the run pending current work on the image data refresh.

Testing Instructions

Check that the properties are updated in the Airflow UI.

Checklist

[best_practices]: https://git-scm.com/book/en/v2/Distributed-Git-Contributing-to-a-Project#_commit_guidelines

Developer Certificate of Origin

Developer Certificate of Origin ``` Developer Certificate of Origin Version 1.1 Copyright (C) 2004, 2006 The Linux Foundation and its contributors. 1 Letterman Drive Suite D4700 San Francisco, CA, 94129 Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. Developer's Certificate of Origin 1.1 By making a contribution to this project, I certify that: (a) The contribution was created in whole or in part by me and I have the right to submit it under the open source license indicated in the file; or (b) The contribution is based upon previous work that, to the best of my knowledge, is covered under an appropriate open source license and I have the right under that license to submit that work with modifications, whether created in whole or in part by me, under the same open source license (unless I am permitted to submit under a different license), as indicated in the file; or (c) The contribution was provided directly to me by some other person who certified (a), (b) or (c) and I have not modified it. (d) I understand and agree that this project and the contribution are public and that a record of the contribution (including all personal information I submit with it, including my sign-off) is maintained indefinitely and may be redistributed consistent with this project or the open source license(s) involved. ```