Closed JIBSN closed 2 months ago
The problem is this loop in parse_pubmed_web_tree()
:
https://github.com/titipata/pubmed_parser/blob/15c477a579bda06642a436e56c149fbf89546ba6/pubmed_parser/pubmed_web_parser.py#L135-L140
The loop traverses all references (!) and returns the last DOI.
Instead, the attribute DOI should directly extract the <ELocationID EIdType="doi" ValidYN="Y"></ELocationID>
tag. See https://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=pubmed&retmode=xml&id=32145645 for the present document.
Describe the bug
To Reproduce
Expected behavior A clear and concise description of what you expected to happen. The correct doi for PMID 32145645 is DOI: 10.1016/j.ejmech.2020.112186
Screenshots If applicable, add screenshots to help explain your problem.![image](https://github.com/titipata/pubmed_parser/assets/10517288/682cfa79-d7eb-4a4c-ac4d-a709581ee58b)