rdmpage / pdf-figure-extraction

Extract figures from born-digital PDFs and render in JATS XML
2 stars 1 forks source link

Memoirs of Museum Victoria missing bits of composite figures, and encoding issues #4

Open rdmpage opened 5 years ago

rdmpage commented 5 years ago

In S1447-25542003006000207 one or more parts of a composite figure are missing, or a treated as separate figures, e.g. Fig. 7 and Fig. 6

rdmpage commented 5 years ago

Also note encoding problem in caption, e.g.

Figure 12. Localities for males of G. austrinum sp. nov. (n south), G. extremum sp. nov. (+, far south), G. imber sp. nov. (s, west and south- west), G. rusticum sp. nov. (n, north-central), G. tarkinense sp. nov. (*, north-west) and G. wynyardense sp. nov. (•, north-west), and for all specimens of G. plomleyi sp. nov. (half-filled squares, north-east).

There should be symbols.