scientist-softserv / adventist_knapsack

Apache License 2.0
2 stars 0 forks source link

Dropbox related_url through Bulkrax #673

Open KatharineV opened 3 months ago

KatharineV commented 3 months ago

We expect Bulkrax importers to be able to ingest publicly available files from Dropbox if we put the right kind of link into the related_url field in a CSV. In testing this functionality, we used a file and link from Dropbox that we confirmed is publicly available. We opened the link in a browser that wasn't logged into our account, and the URL automatically downloaded the PDF. However, when we ran the CSV through Bulkrax, we saw errors. The work created nicely, the metadata imported correctly, but the fileset failed to attach the PDF to the work.

Kirk talked us through some variations of testing on Slack, so the importer and the work page have been modified since the first error. Kirk gave us a modified URL to try. When we edited the importer, it removed the weird placeholder not-a-pdf thing that was attached to the work. But the importer failed with a TypeError. So that's the state of the work and the importer right now.

Here's a link to our Slack thread: https://samvera.slack.com/archives/C7E4KK8ER/p1716307192307609

Importer: https://adl.b2.adventistdigitallibrary.org/importers/314?locale=en Work page: https://adl.b2.adventistdigitallibrary.org/concern/published_works/car_003001_b_why_i_am_now_a_seventh_day_adventist_and_an_outline_of_the_re?locale=en Dropbox URL: https://www.dropbox.com/scl/fi/wbb1qxgi4352ldz9jjsm4/003001.pdf?rlkey=lxn2qlzp9zh6ozfstp498h5ft&st=b6xhd0u3&dl=1&/003001.pdf