WaxCylinderRevival / frus-dates-project

Project repository for FRUS date extraction and normalization initiative
https://history.state.gov
GNU General Public License v3.0
1 stars 0 forks source link

Find/Convert dates in `postscript` of attachments -> `dateline` #1050

Closed WaxCylinderRevival closed 7 years ago

WaxCylinderRevival commented 7 years ago

RegEx used in XPath/Xquery Builder:

//div[attribute::type='document'][not(attribute::subtype='editorial-note')]//*[local-name()='attachment'][not(descendant::date)]//postscript[matches(.,'\d{1,2}(st|d|nd|rd|th)?\s+(January|February|March|April|May|June|July|August|September|October|November|December),
*\s+\d{4}|(January|February|March|April|May|June|July|August|September|October|November|December)\s+\d{1,2}(st|d|nd|rd|th)?,
\s+\d{4})')]
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago
WaxCylinderRevival commented 7 years ago