cwrc / RDF-extraction

0 stars 0 forks source link

Investigate place extraction in bibliography #52

Closed alliyya closed 1 year ago

alliyya commented 1 year ago

See https://gitlab.com/calincs/conversion/metadata-conversion/-/issues/202 for more context.

Double check that split_place_parts(place) is working.

alliyya commented 1 year ago

Also need to review date extraction: Only dates with encoding should be extracted: encoding="iso8601"

Unless a significant amount of publications don't have dates with encoding. ex from: 51a0f1ac-a935-4030-af4c-1efa386311e2.xml

January 1756 - June 1817
  <dateIssued type="start"/>
  <dateIssued type="end" encoding="iso8601">1817-06</dateIssued>