scientist-softserv / britishlibrary

Other
3 stars 0 forks source link

SOW: Asynchronous Import of URLs #518

Open jillpe opened 8 months ago

jillpe commented 8 months ago

Summary

SoftServ will move importing of files that come from cloud resources to be asynchronous of the initial processing of the CSV row via Bulkrax. This will likely involve wrapping the original import job with a background job that first fetches the remote resource then begins importing.

Testing Instructions

jillpe commented 8 months ago

SoftServ QA: ✅ Work ingested and file is able to be downloaded Importer

Screenshots ![Image](https://github.com/scientist-softserv/britishlibrary/assets/84697174/ae07193d-05d9-47a7-92de-9679b9761744) ![Image](https://github.com/scientist-softserv/britishlibrary/assets/84697174/0b08aab7-1bb9-4fcc-8a42-ffb4c3efcdf8)
NoraRamsey commented 8 months ago

BL QA: ✅ Work ingested and file is able to be downloaded Importer

grahamjevon commented 8 months ago

Will continue testing once Rory has got AWS buckets ready.

cziaarm commented 7 months ago

Confirmed (by me) that bulkrax import of large file from browse-everything s3 was successful: https://bl.bl-staging.notch8.cloud/concern/articles/2f1325f5-f962-4279-b83b-6dc22ef259db?locale=en

j-basford commented 7 months ago

@NoraRamsey can you test this on Thursday please?

ShanaLMoore commented 6 months ago

@cziaarm Can we close this ticket? If not, what is remaining? cc @kirkkwang

cziaarm commented 6 months ago

@cziaarm Can we close this ticket? If not, what is remaining? cc @kirkkwang

Hi @ShanaLMoore nothing is remaining on this ticket, but because the BL are testing with larger files and these are failing due to (i think) modeshape version in use by fcrepo https://assaydepot.slack.com/archives/C0313NK2LJ0/p1712662620970479?thread_ts=1712156637.197069&cid=C0313NK2LJ0)

I'll make a new ticket for that (#534) and then this one can be moved along