inspirehep / hepcrawl

Scrapy project for feeds into INSPIRE-HEP
http://inspirehep.net
Other
17 stars 30 forks source link

parsers: create an NLM parser #209

Open szymonlopaciuk opened 6 years ago

szymonlopaciuk commented 6 years ago

Description

This is an implementation of a parser for the NLM format, it takes a very similar approach to the JATS parser which we already have, using LiteratureBuilder to build HEP records.

Related Issue

This is a step towards refreshing the IOP spider (#205)

Motivation and Context

IOP uses NLM format to publish their citation records. Currently the IOP spider uses web-scraping, however we will move to using OAI-PMH and this instead.

Checklist: