cern-sis / issues-scoap3

0 stars 0 forks source link

Halted articles because arXiv id not recognised #175

Closed agentilb closed 9 months ago

agentilb commented 11 months ago

Hi,

We have a bunch of halted articles that are "halted" because the system cannot find the arXiv ID. Message: "Could not determine arXiv category based on id." But it seems the arXiv ID is fine:

Doi: 10.1103/PhysRevD.108.012003 Doi: 10.1103/PhysRevLett.131.021801 Doi: 10.1103/PhysRevD.108.015011 Doi: 10.1103/PhysRevD.108.014006 Doi: 10.1103/PhysRevD.108.014503 Doi: 10.1103/PhysRevC.108.014906 Doi: 10.1140/epjc/s10052-023-11737-y Doi: 10.1007/JHEP07(2023)079 Doi: 10.1007/JHEP07(2023)081 Doi: 10.1007/JHEP07(2023)082 Doi: 10.1007/JHEP07(2023)087

Could you have a look?

Thanks!

Anne

ErnestaP commented 11 months ago

These articles are in repo and have arxiv ids and categories: Doi: 10.1103/PhysRevD.108.012003 Doi: 10.1103/PhysRevLett.131.021801 Doi: 10.1103/PhysRevD.108.015011 Doi: 10.1103/PhysRevD.108.014006 Doi: 10.1103/PhysRevD.108.014503 Doi: 10.1103/PhysRevC.108.014906

These not: Doi: 10.1140/epjc/s10052-023-11737-y - arxiv id: 2208.07695v2, link to workflows This article can be found on arxiv: https://arxiv.org/abs/2208.07695v2 Doi: 10.1007/JHEP07(2023)079 - arxiv id: 2304.08509, link to workflows This article can be found on arxiv: https://arxiv.org/abs/2304.08509

Doi: 10.1007/JHEP07(2023)081 - arxiv id: 2305.01736, link to workflows This article can be found on arxiv: https://arxiv.org/abs/2305.01736

Doi: 10.1007/JHEP07(2023)082 - arxiv id: 2303.10237, link to workflows This article can be found on arxiv: https://arxiv.org/abs/2303.10237

Doi: 10.1007/JHEP07(2023)087 - arxiv id: 2304.03663, link to workflows This article can be found on arxiv: https://arxiv.org/abs/2304.03663

None of these axiv ids are found of arxiv api, where we receive the categories. Can it be that these articles are not posted in https://export.arxiv.org/api?

For DEVELOPERS: The API looks like this: https://export.arxiv.org/api/query?search_query=id:VALUE_OF_ARXIV_ID

ErnestaP commented 9 months ago

The only one article which is missing: Doi: 10.1007/JHEP07(2023)079 - arxiv id: 2304.08509, link to workflows This article can be found on arxiv: https://arxiv.org/abs/2304.08509

Tried to restart the parsing, because categories are now available on arxiv API. However, it crashed: https://sentry.siscern.org/scoap3/scoap3/issues/319393/?referrer=alert_email

ErnestaP commented 9 months ago

Fixed: The JSON output was missing key "categories". Fix, the article is in the repo: https://repo.scoap3.org/records/80487