IsisCB / IsisCBLegacyDigitization

MIT License
0 stars 0 forks source link

Unrecognised cross-references #45

Open Conal-Tuohy opened 8 years ago

Conal-Tuohy commented 8 years ago

@StephenPWeldon says:

Cross references beginning with "FOR " Cross reference found by looking for short FullCitation (< 69 characters long) and the word " See " Cross reference of short Full citation <34 characters) containing the string ". See"

In vols 4,5 and 7 there are almost 1000 FOR cross-references not in bold, and hence not captured as such, and rather mistaken for citations (since "FOR" looks like it could be an author name). Solution: modify recognise-see-cross-reference.xsl to also recognise such references, and tag them so that the citation recogniser will later ignore them.

In vols 1-6 there are almost 1000 cross-references which are NOT in bold (as recognise-see-cross-reference.xsl expects), though they do contain the italicised word "See". The same issue as previous "FOR" references, and same solution.

Probably a couple of hours work.