cern-sis / issues-scoap3

0 stars 0 forks source link

create a script to check how many PDFAs are missing from Elsevier and attach them to the records #103

Closed drjova closed 1 year ago

drjova commented 1 year ago
ErnestaP commented 1 year ago

script

no_files_at_all in total 4; no_pdfas in total: 924

elsevier_no_pdfas_before_update.txt

ErnestaP commented 1 year ago

Attaching records with found pdfas

elsevier_records_with_found_pdfas.txt

ErnestaP commented 1 year ago

Updated: 527 Left without updated: 395

List of updated dois, all of them were from 2017-2019: list_of_updated_elsevier_articles.txt

List of not updated records, from years 2016, 2019, 2022: cannot_find_pdfas.txt

Couldn’t find any files for these four: {u'10.1016/j.nuclphysb.2022.115733': u'68468', u'10.1016/j.nuclphysb.2022.115745': u'68567', u'10.1016/j.physletb.2020.136001': u'58561', u'10.1016/j.physletb.2022.137453': u'72778'}

agentilb commented 1 year ago

Thank you very much! After having checked the list of recent articles with not pdf/a, I see that some have now been published. so we should have the pdf/a. But is there a way to know how long after publication pdf/a is sent to us?

ErnestaP commented 1 year ago

Statistics for the interval between record creation and the date when the pdfa file was received (2020-2022): https://docs.google.com/spreadsheets/d/1u2YsA_EvcgQjugnN3_iGhnI5JdmtW5Fu9lleLdbM_2E/edit?usp=sharing

drjova commented 1 year ago

Filter 2022 and check on crossref if DOI is published (waiting for @agentilb for the field) if yes add it to a list and at the end send it to Anne to check with Elsevier.

agentilb commented 1 year ago

We can close this issue, there is no problem with Pdf/a.