punkish / bomfim

extract tags from xml files
Creative Commons Zero v1.0 Universal
1 stars 1 forks source link

Comments Variance-on-Attr-Type-treatmentCitationAuthors #9

Open myrmoteras opened 5 years ago

myrmoteras commented 5 years ago

only this is from Plazi PDF XML

<subSubSection type="nomenclature"> | TRUE | 287774

all the rest is from Pensoft

tag with   attr="type" | authors&ranks <subSubSection   type="no + | frequency
-- | -- | --
<subSubSection type="authors of   description"> | TRUE | 1
<subSubSection type="authors of   the description"> | TRUE | 1
<subSubSection   type="authors’ contributions"> | TRUE | 1
<subSubSection   type="nomenclatorial note"> | TRUE | 1
<subSubSection   type="nomenclatorial remark"> | TRUE | 1
<subSubSection   type="nomenclatural and taxonomic emendations"> | TRUE | 2
<subSubSection   type="nomenclatural and taxonomic remarks"> | TRUE | 3
<subSubSection   type="nomenclatural and taxonomical notes"> | TRUE | 2
<subSubSection   type="nomenclatural comment"> | TRUE | 2
<subSubSection   type="nomenclatural comments"> | TRUE | 5
<subSubSection   type="nomenclatural note"> | TRUE | 16
<subSubSection   type="nomenclatural remarks"> | TRUE | 11
<subSubSection type="nomenclature   citation"> | TRUE | 1
<subSubSection type="nomenclature   notes"> | TRUE | 1
<subSubSection type="nomenclature   of the type species"> | TRUE | 1
<subSubSection type="nomenclature   remarks"> | TRUE | 2
<subSubSection   type="nomenclature-citation"> | TRUE | 5
<subSubSection type="non-type   materials examined"> | TRUE | 1
<subSubSection type="non-type   specimens (not collected) photographed in situ"> | TRUE | 1
<subSubSection type="non-type   specimens examined"> | TRUE | 1