VisSieve / main

https://vissieve.github.io/main/documentation/site
0 stars 0 forks source link

apply hpc pdf grabber to princeton url list #13

Closed DevinBayly closed 1 week ago

DevinBayly commented 2 weeks ago

(once code developed by Devin) apply HPC version of pdf grabber from url list (princeton)

DevinBayly commented 2 weeks ago

Note that the fix for the playwright in the container not working with singularity stems from the fact that when we install playwright stuff it's as root, and so the chrome binary isn't accessible to the user

DevinBayly commented 2 weeks ago

Actually this isn't the fix, it's still something else,

but I was able to start with singularity shell -f -B $PWD:/opt/work playwright.sif and the perform playwright install and this appears to persist between runs

DevinBayly commented 1 week ago

got this to work, and was able to run the 2022 works list, and got about 2133 pdfs