jhpoelen / hmw

(experimental) Machine readable version of Handbook of the Mammals of the World
https://jhpoelen.nl/hmw/
Creative Commons Zero v1.0 Universal
5 stars 2 forks source link

split hmw.csv into separate volumes for ease of processing #6

Closed jhpoelen closed 1 year ago

jhpoelen commented 2 years ago

@ajacsherman and @n8upham suggested to split the hmw into separate files per volume.

Here's the unique docOrigins as provided by Plazi's transcriptions. Note that volume 9 has various spelling variations. @myrmoteras are variations in docOrigins descriptions expected? Should I use a different field to partition the corpus by?

$ cat hmw.json | jq --raw-output .docOrigin | sort | uniq -c | sort -nr
   1711 Handbook of the Mammals of the World – Volume 7 Rodents II, Barcelona: Lynx Edicions
   1233 Handbook of the Mammals of the World – Volume 9 Bats, Barcelona: Lynx Edicions
    845 Handbook of the Mammals of the World – Volume 6 Lagomorphs and Rodents I, Barcelona: Lynx Edicions
    635 Handbook of the Mammals of the World – Volume 8 Insectivores, Sloths and Colugos, Barcelona: Lynx Edicions
    493 Handbook of the Mammals of the World – Volume 3 Primates, Barcelona: Lynx Edicions
    374 Handbook of the Mammals of the World – Volume 5 Monotremes and Marsupials, Barcelona: Lynx Edicions
    362 Handbook of the Mammals of the World – Volume 2 Hoofed Mammals, Barcelona: Lynx Edicions
    259 Handbook of the Mammals of the World – Volume 1 Carnivores, Barcelona: Lynx Edicions
    140 Handbook of the Mammals of the World – Volume 4 Sea Mammals, Barcelona: Lynx Edicions
    110 Handbook of the Mammals of the World – Volume 9 Bats, Lynx Edicions
     54 Handbook of the Mammals of the World – Volume 9 Bats, Lyny Edicions
      1 Handbook of the Mammals of the World, Vol. 9, Lyny Edicions

without records count:

$ cat hmw.json | jq --raw-output .docOrigin | sort | uniq
Handbook of the Mammals of the World, Vol. 9, Lyny Edicions
Handbook of the Mammals of the World – Volume 1 Carnivores, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 2 Hoofed Mammals, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 3 Primates, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 4 Sea Mammals, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 5 Monotremes and Marsupials, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 6 Lagomorphs and Rodents I, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 7 Rodents II, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 8 Insectivores, Sloths and Colugos, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 9 Bats, Barcelona: Lynx Edicions
Handbook of the Mammals of the World – Volume 9 Bats, Lynx Edicions
Handbook of the Mammals of the World – Volume 9 Bats, Lyny Edicions
flsimoes commented 2 years ago
110 Handbook of the Mammals of the World – Volume 9 Bats, Lynx Edicions
     54 Handbook of the Mammals of the World – Volume 9 Bats, Lyny Edicions
      1 Handbook of the Mammals of the World, Vol. 9, Lyny Edicions

I fixed the entry that was causing those 54 issues named as "Lyny" However, I have now clue where the third row is coming from.

jhpoelen commented 2 years ago

@flsimoes thanks!

Here's the provenance of the third entry -

{
  "http://www.w3.org/ns/prov#wasDerivedFrom": "zip:hash://sha256/15afc61b5914e4b6899b43a5eda355497d5c5d0c4b401c884c8056f418c9665b!/treatments-xml-main/data/03/D5/87/03D587F2FFC94C03F8F13AECFBD8F765.xml",
  "http://www.w3.org/1999/02/22-rdf-syntax-ns#type": "application/plazi+xml",
  "docId": "03D587F2FFC94C03F8F13AECFBD8F765",
  "docName": "hbmw-9.emballorunidae.pdf.imd",
  "docOrigin": "Handbook of the Mammals of the World, Vol. 9, Lyny Edicions",
  "docMasterId": "hash://md5/ffecff8affcf4c04ffa53577fff8ffe9",
  "docISBN": "978-84-16728-19-0",
  "docPageNumber": "355",
  "verbatimText": "14 . Troughton’s Sheath-tailed Bat Taphozous troughtoni French : Taphien deTroughton I German : Troughton-Grabfledermaus I Spanish : Tafozo de Troughton Other common names: Troughton'sTomb Bat Taxonomy . Taphozous troughtoni Tate, 1952 , “ Rifle Creek, Mt. Isa, northwest Queensland ,” Australia . Taphozous troughtoni is in the subgenus Taphozous . It was considered ajunior synonym of T georgianus , but. T. Chimimba and D. J. Kitchener in 1991 raised it to a distinct species. Monotypic. Distribution. NE Australia endemic, in WC, C & E Queensland. Descriptive notes. Head-body 79-4-86-3 mm, tail 31-5-36-9 mm, ear 22-4-27-1 mm, hindfoot 9-8-10-3 mm, forearm 73-76 mm; weight. 20-29 g. Dorsum of Troughton’s Sheath-tailed Bat is predominately olive­ brown, with pale mouse-gray guard hairs. Venter surface hairs are olive-brown from chin to shoulders and posteriorly dark yellow-brown, with guard hairs of pale mouse-gray. Uropatagium close to abdomen is heavily furred. Throat pouches are absent, and radio-metacarpal sacs are present in both sexes. Skin of rhinarium, wings, uropatagium, lips, face, and tragus are fuscous (pale yellow). Habitat . Wide variety of habitats and bioregions of interior Queensland. Food and Feeding . Troughton’s Sheath-tailed Bats forage for insects well above tree canopies and high over open habitats. Large, high-flying grasshoppers are preferred food items and often taken back to cave roosts to eat. Breeding . No information. Activity patterns. Troughton’s Sheath-tailed Bat roosts in caves, mines and tunnels, rock crevices, and rocky escarpments. Echolocation call is less than 25 kHz and distinguishes it from the Common Sheath-tailed Bat (. georgianus ) where they co-occur . Movements, Home range and Social organization. Large colonies of Troughton’s Sheath-tailed Bat can be found in landscapes with abundant rocky outcrops, especially in tower karst. Colony size might be limited by roosting structures, especially in more arid areas where there are few caves deep enough to support large colonies. Status and Conservation . Classified as Least Concern on TheIUCNRed List. Troughton’s Sheath-tailed Bat has a large distribution and presumably large and stable overall population, uses a wide variety of habitats, occurs in protected areas, and does not face significant threats. It was originally recorded only from a small area in the Mount Isa Inland bioregion of Queensland, but recent studies based on isozymes and echolocation calls extend distribution further east throughout much of interior and near coastal region of central Queensland, formerly attributed to the Common Sheathtailed Bat. Recent reports of absence of Troughton’s Sheath-tailed Bat in western parts of its distribution require additional verification, possibly leading to re-evaluation of its conservation status after taxonomic issues are clarified. Bibliography. Chimimba & Kitchener (1991), Hall (2008b), McKean & Price (1967), Reardon & Thomson (2002), Tate (1952),Thomson eta /. (2001), Woinarski eta/. (2014).",
  "taxonomy": "Taphozous troughtoni Tate, 1952 , “ Rifle Creek, Mt. Isa, northwest Queensland ,” Australia . Taphozous troughtoni is in the subgenus Taphozous . It was considered ajunior synonym of T georgianus , but. T. Chimimba and D. J. Kitchener in 1991 raised it to a distinct species. Monotypic.",
  "commonNames": "Troughton’s Sheath-tailed Bat @en | Taphien de Troughton @fr | Troughton-Grabfledermaus @de | Tafozode Troughton @es | Troughton's Tomb Bat @en",
  "interpretedAuthorityName": "Tate",
  "interpretedAuthorityYear": "1952",
  "interpretedClass": "Mammalia",
  "interpretedFamily": "Emballonuridae",
  "interpretedGenus": "Taphozous",
  "interpretedKingdom": "Animalia",
  "interpretedOrder": "Chiroptera",
  "interpretedPageId": "6",
  "interpretedPageNumber": "355",
  "interpretedPhylum": "Chordata",
  "interpretedRank": "species",
  "interpretedSpecies": "troughtoni",
  "name": "Taphozous troughtoni",
  "subspeciesAndDistribution": "NE Australia endemic, in WC, C & E Queensland.",
  "distributionImageURL": "https://zenodo.org/record/3747930/files/figure.png",
  "bibliography": "Chimimba & Kitchener (1991) | Hall (2008b) | McKean & Price (1967) | Reardon & Thomson (2002) | Tate (1952) | Thomson et al. (2001) | Woinarski et al. (2014)",
  "foodAndFeeding": "Troughton’s Sheath-tailed Bats forage for insects well above tree canopies and high over open habitats. Large, high-flying grasshoppers are preferred food items and often taken back to cave roosts to eat.",
  "breeding": "No information.",
  "activityPatterns": "Troughton’s Sheath-tailed Bat roosts in caves, mines and tunnels, rock crevices, and rocky escarpments. Echolocation call is less than 25 kHz and distinguishes it from the Common Sheath-tailed Bat (. georgianus ) where they co-occur .",
  "movementsHomeRangeAndSocialOrganization": "Large colonies of Troughton’s Sheath-tailed Bat can be found in landscapes with abundant rocky outcrops, especially in tower karst. Colony size might be limited by roosting structures, especially in more arid areas where there are few caves deep enough to support large colonies.",
  "statusAndConservation": "Classified as Least Concern on TheIUCNRed List. Troughton’s Sheath-tailed Bat has a large distribution and presumably large and stable overall population, uses a wide variety of habitats, occurs in protected areas, and does not face significant threats. It was originally recorded only from a small area in the Mount Isa Inland bioregion of Queensland, but recent studies based on isozymes and echolocation calls extend distribution further east throughout much of interior and near coastal region of central Queensland, formerly attributed to the Common Sheathtailed Bat. Recent reports of absence of Troughton’s Sheath-tailed Bat in western parts of its distribution require additional verification, possibly leading to re-evaluation of its conservation status after taxonomic issues are clarified.",
  "descriptiveNotes": "Head-body 79-4-86-3 mm, tail 31-5-36-9 mm, ear 22-4-27-1 mm, hindfoot 9-8-10-3 mm, forearm 73-76 mm; weight. 20-29 g. Dorsum of Troughton’s Sheath-tailed Bat is predominately olive­ brown, with pale mouse-gray guard hairs. Venter surface hairs are olive-brown from chin to shoulders and posteriorly dark yellow-brown, with guard hairs of pale mouse-gray. Uropatagium close to abdomen is heavily furred. Throat pouches are absent, and radio-metacarpal sacs are present in both sexes. Skin of rhinarium, wings, uropatagium, lips, face, and tragus are fuscous (pale yellow).",
  "habitat": "Wide variety of habitats and bioregions of interior Queensland."
}

with

$ preston cat 'zip:hash://sha256/15afc61b5914e4b6899b43a5eda355497d5c5d0c4b401c884c8056f418c9665b!/treatments-xml-main/data/03/D5/87/03D587F2FFC94C03F8F13AECFBD8F765.xml'

producing:

<document ID-DOI="http://doi.org/10.5281/zenodo.3740269" ID-GBIF-Dataset="34e4a5a6-881c-4a1a-b64d-6868e12df7fb" ID-GBIF-Taxon="163435150" ID-ISBN="978-84-16728-19-0" ID-Zenodo-Dep="3740269" approvalRequired="518" approvalRequired_for_document="1" approvalRequired_for_originalDoi="1" approvalRequired_for_taxonomicNames="152" approvalRequired_for_textStreams="311" approvalRequired_for_treatments="53" checkinTime="1586037971521" checkinUser="plazi" docAuthor="Bonaccorso, Frank" docDate="2019" docId="03D587F2FFC94C03F8F13AECFBD8F765" docLanguage="en" docName="hbmw-9.emballorunidae.pdf.imd" docOrigin="Handbook of the Mammals of the World, Vol. 9, Lyny Edicions" docStyle="DocumentStyle{}" docTitle="Taphozous troughtoni Tate 1952" docType="treatment" docVersion="11" lastPageId="7" lastPageNumber="5" masterDocId="FFECFF8AFFCF4C04FFA53577FFF8FFE9" masterDocTitle="Family Emballonuridae (Sheath-tailed Bats)" masterLastPageNumber="373" masterPageNumber="350" pageId="6" pageNumber="355" updateTime="1637932782092" updateUser="ExternalLinkService">
<mods:mods xmlns:mods="http://www.loc.gov/mods/v3">
<mods:titleInfo>
<mods:title>Family Emballonuridae (Sheath-tailed Bats)</mods:title>
</mods:titleInfo>
<mods:name type="personal">
<mods:role>
<mods:roleTerm>Author</mods:roleTerm>
</mods:role>
<mods:namePart>Bonaccorso, Frank</mods:namePart>
</mods:name>
<mods:typeOfResource>text</mods:typeOfResource>
<mods:relatedItem type="host">
<mods:originInfo>
<mods:dateIssued>2019</mods:dateIssued>
<mods:dateOther type="pubDate">2019-10-31</mods:dateOther>
<mods:publisher>Lyny Edicions</mods:publisher>
</mods:originInfo>
<mods:name type="personal">
<mods:role>
<mods:roleTerm>Editor</mods:roleTerm>
</mods:role>
<mods:namePart>Wilson, Don E.</mods:namePart>
</mods:name>
<mods:name type="personal">
<mods:role>
<mods:roleTerm>Editor</mods:roleTerm>
</mods:role>
<mods:namePart>Mittermeier, Russel A.</mods:namePart>
</mods:name>
<mods:titleInfo>
<mods:title>Handbook of the Mammals of the World, Vol. 9</mods:title>
</mods:titleInfo>
<mods:part>
<mods:extent unit="page">
<mods:start>350</mods:start>
<mods:end>373</mods:end>
</mods:extent>
</mods:part>
</mods:relatedItem>
<mods:classification>book chapter</mods:classification>
<mods:identifier type="DOI">http://doi.org/10.5281/zenodo.3740269</mods:identifier>
<mods:identifier type="GBIF-Dataset">34e4a5a6-881c-4a1a-b64d-6868e12df7fb</mods:identifier>
<mods:identifier type="ISBN">978-84-16728-19-0</mods:identifier>
<mods:identifier type="Zenodo-Dep">3740269</mods:identifier>
</mods:mods>
<treatment ID-GBIF-Taxon="163435150" LSID="urn:lsid:plazi:treatment:03D587F2FFC94C03F8F13AECFBD8F765" httpUri="http://treatment.plazi.org/id/03D587F2FFC94C03F8F13AECFBD8F765" lastPageId="7" lastPageNumber="5" pageId="6" pageNumber="355">
<subSubSection box="[1876,2811,3995,4056]" pageId="6" pageNumber="355" type="vernacular_names">
<paragraph blockId="6.[1876,3420,3995,4056]" box="[1876,2811,3995,4056]" pageId="6" pageNumber="355">
<figureCitation box="[1876,1943,3995,4056]" captionStart="On" captionStartId="2.[173,216,4419,4446]" captionTargetBox="[177,3496,132,4697]" captionTargetPageId="1" captionText="On following pages: 4. Hamilton'sTomb Bat [Taphozous hamiltoni); 5. Hildegarde'sTomb Bat (Taphozous hildegardeae}; 6. Long-winged Sheath-tailed Bat (Taphozous longimanus); 7. Black-beardedTomb Bat (Taphozous melanopogoh); 8. Theobald'sTomb Bat (Taphozous theobaldi); 9. Indonesian Tomb Bat (Taphozous achates); 10. Arnhem Sheath-tailed Bat (Taphozous kapalgensis}; 11. Coastal Sheath-tailed Bat (Taphozous australis}; 12. Common Sheath-tailed Bat (Taphozous georgianus); 13. Hill's Sheath-tailed Bat (Taphozous hilli); 14. Troughton's Sheath-tailed Bat (Taphozous troughtoni); 15. Pel's Pouched Bat (Saccolaimus peli); 16. Bare-rumped Sheathtailed Bat (Saccolaimus saccolaimus); 17. Papuan Sheath-tailed Bat (Saccolaimus mixtus); 18. Yellow-bellied Sheath-tailed Bat (Saccolaimus flaviventris)." figureDoi="http://doi.org/10.5281/zenodo.3740271" httpUri="https://zenodo.org/record/3740271/files/figure.png" pageId="6" pageNumber="355">14</figureCitation>
. 
<emphasis box="[1972,2811,3995,4056]" pageId="6" pageNumber="355">Troughton’s Sheath-tailed Bat</emphasis>
</paragraph>
</subSubSection>
<subSubSection box="[2881,3420,3995,4056]" pageId="6" pageNumber="355" type="nomenclature">
<paragraph blockId="6.[1876,3420,3995,4056]" box="[2881,3420,3995,4056]" pageId="6" pageNumber="355">
<taxonomicName authorityName="Tate" authorityYear="1952" box="[2881,3420,3995,4056]" class="Mammalia" family="Emballonuridae" genus="Taphozous" kingdom="Animalia" order="Chiroptera" pageId="6" pageNumber="355" phylum="Chordata" rank="species" species="troughtoni">
<emphasis box="[2881,3420,3995,4056]" italics="true" pageId="6" pageNumber="355">Taphozous troughtoni</emphasis>
</taxonomicName>
</paragraph>
</subSubSection>
<subSubSection pageId="6" pageNumber="355" type="vernacular_names">
<paragraph blockId="6.[1871,3203,4070,4150]" box="[1872,3203,4070,4097]" pageId="6" pageNumber="355">
<emphasis bold="true" box="[1872,1971,4070,4097]" pageId="6" pageNumber="355">French</emphasis>
: Taphien 
<emphasis box="[2093,2284,4070,4097]" italics="true" pageId="6" pageNumber="355">deTroughton I</emphasis>
<emphasis bold="true" box="[2292,2411,4070,4097]" pageId="6" pageNumber="355">German</emphasis>
: Troughton-Grabfledermaus 
<emphasis box="[2787,2798,4070,4097]" italics="true" pageId="6" pageNumber="355">I</emphasis>
<emphasis bold="true" box="[2806,2926,4070,4097]" pageId="6" pageNumber="355">Spanish</emphasis>
: 
<emphasis box="[2935,3024,4070,4097]" italics="true" pageId="6" pageNumber="355">Tafozo</emphasis>
de 
<emphasis box="[3069,3203,4070,4097]" italics="true" pageId="6" pageNumber="355">Troughton</emphasis>
</paragraph>
<paragraph blockId="6.[1871,3203,4070,4150]" box="[1871,2496,4123,4150]" pageId="6" pageNumber="355">
<emphasis bold="true" box="[1871,2203,4126,4149]" pageId="6" pageNumber="355">Other common names:</emphasis>
Troughton'sTomb Bat
</paragraph>
</subSubSection>
<subSubSection pageId="6" pageNumber="355" type="reference_group">
<paragraph blockId="6.[1869,3485,4220,4645]" pageId="6" pageNumber="355">
<emphasis bold="true" box="[1870,2081,4221,4260]" pageId="6" pageNumber="355">
<emphasis box="[1870,2073,4221,4260]" pageId="6" pageNumber="355">Taxonomy</emphasis>
.
</emphasis>
<taxonomicName authority="Tate, 1952" authorityName="Tate" authorityYear="1952" box="[2102,2682,4221,4260]" class="Mammalia" family="Emballonuridae" genus="Taphozous" kingdom="Animalia" order="Chiroptera" pageId="6" pageNumber="355" phylum="Chordata" rank="species" species="troughtoni">
<emphasis box="[2102,2470,4221,4260]" italics="true" pageId="6" pageNumber="355">Taphozous troughtoni</emphasis>
Tate, 1952
</taxonomicName>
, “ 
<materialsCitation ID-GBIF-Occurrence="2595781813" country="Australia" location="Rifle Creek, Mt. Isa, northwest" pageId="6" pageNumber="355" specimenCount="1" stateProvince="Queensland" typeStatus="type">
<location LSID="urn:lsid:plazi:treatment:03D587F2FFC94C03F8F13AECFBD8F765:8EA3603FFFC94C02F507250AF30AEF4D" box="[2722,3314,4221,4260]" country="Australia" name="Rifle Creek, Mt. Isa, northwest" pageId="6" pageNumber="355" stateProvince="Queensland">Rifle Creek, Mt. Isa, northwest</location>
<collectingRegion country="Australia" name="Queensland" pageId="6" pageNumber="355">Queensland</collectingRegion>
,” 
<collectingCountry box="[1992,2164,4272,4311]" name="Australia" pageId="6" pageNumber="355">Australia</collectingCountry>
</materialsCitation>
.
</paragraph>
</subSubSection>
<subSubSection pageId="6" pageNumber="355" type="discussion">
<paragraph blockId="6.[1869,3485,4220,4645]" pageId="6" pageNumber="355">
<taxonomicName authorityName="Tate" authorityYear="1952" box="[1875,2241,4326,4365]" class="Mammalia" family="Emballonuridae" genus="Taphozous" kingdom="Animalia" order="Chiroptera" pageId="6" pageNumber="355" phylum="Chordata" rank="species" species="troughtoni">
<emphasis box="[1875,2241,4326,4365]" italics="true" pageId="6" pageNumber="355">Taphozous troughtoni</emphasis>
</taxonomicName>
is in the subgenus 
<emphasis box="[2611,2794,4326,4365]" italics="true" pageId="6" pageNumber="355">
<taxonomicName box="[2611,2789,4326,4365]" class="Mammalia" family="Emballonuridae" genus="Taphozous" kingdom="Animalia" order="Chiroptera" pageId="6" pageNumber="355" phylum="Chordata" rank="genus">Taphozous</taxonomicName>
.
</emphasis>
It was considered ajunior synonym of 
<emphasis box="[1926,2161,4378,4417]" italics="true" pageId="6" pageNumber="355">
<taxonomicName authorityName="Thomas" authorityYear="1915" box="[1926,2155,4378,4417]" class="Mammalia" family="Emballonuridae" genus="Taphozous" kingdom="Animalia" order="Chiroptera" pageId="6" pageNumber="355" phylum="Chordata" rank="species" species="georgianus">T georgianus</taxonomicName>
,
</emphasis>
but. T. Chimimba and D. J. Kitchener in 1991 raised 
<emphasis box="[3222,3250,4378,4417]" italics="true" pageId="6" pageNumber="355">it</emphasis>
to a distinct species. Monotypic.
</paragraph>
</subSubSection>
<subSubSection pageId="6" pageNumber="355" type="distribution">
<caption ID-DOI="http://doi.org/10.5281/zenodo.3747930" ID-Zenodo-Dep="3747930" box="[1870,3058,4483,4522]" httpUri="https://zenodo.org/record/3747930/files/figure.png" inLine="true" pageId="6" pageNumber="355" targetBox="[209,1001,354,916]" targetPageId="7">
<paragraph blockId="6.[1869,3485,4220,4645]" box="[1870,3058,4483,4522]" pageId="6" pageNumber="355">
<emphasis bold="true" box="[1870,2107,4483,4522]" pageId="6" pageNumber="355">Distribution.</emphasis>
NE Australia endemic, in WC, C &amp; E Queensland.
</paragraph>
</caption>
<paragraph blockId="6.[1869,3485,4220,4645]" lastBlockId="7.[1030,1829,349,920]" lastPageId="7" lastPageNumber="5" pageId="6" pageNumber="355">
Descriptive 
<emphasis bold="true" box="[2095,2207,4536,4575]" pageId="6" pageNumber="355">notes.</emphasis>
<emphasis box="[2223,2442,4536,4575]" italics="true" pageId="6" pageNumber="355">Head-body</emphasis>
79-4-86-3 mm, tail 31-5-36-9 mm, ear 22-4-27-1 mm, hindfoot 9-8-10-3 mm, forearm 73-76 mm; weight. 20-29 g. Dorsum of Troughton’s 
<subSubSection pageId="7" pageNumber="5" type="biology_ecology">
Sheath-tailed Bat is predominately olive­ brown, with pale mouse-gray guard hairs. Venter surface hairs are 
<emphasis box="[1502,1722,456,495]" italics="true" pageId="7" pageNumber="5">olive-brown</emphasis>
from chin to shoulders and posteriorly dark yellow-brown, with guard hairs of pale mouse-gray. Uropatagium close to abdomen is heavily furred. Throat pouches are absent, and radio-metacarpal sacs are present in both sexes. Skin of rhinarium, wings, uropatagium, lips, face, and 
<taxonomicName box="[1714,1827,828,867]" class="Liliopsida" family="Poaceae" genus="Tragus" kingdom="Plantae" order="Poales" pageId="7" pageNumber="354" phylum="Tracheophyta" rank="genus">tragus</taxonomicName>
are fuscous (pale yellow). 
<paragraph blockId="7.[207,1830,934,2188]" box="[209,1569,934,973]" pageId="7" pageNumber="5">
<emphasis box="[209,352,934,973]" pageId="7" pageNumber="5">Habitat</emphasis>
. Wide variety of habitats and bioregions of interior Queensland.
</paragraph>
<paragraph blockId="7.[207,1830,934,2188]" pageId="7" pageNumber="5">
<emphasis bold="true" box="[209,569,987,1026]" pageId="7" pageNumber="5">
<emphasis box="[209,561,987,1026]" pageId="7" pageNumber="5">Food and Feeding</emphasis>
.
</emphasis>
Troughton’s Sheath-tailed Bats forage for 
<emphasis box="[1399,1524,987,1026]" italics="true" pageId="7" pageNumber="5">insects</emphasis>
well above tree canopies and high over open habitats. Large, high-flying grasshoppers are preferred food items and often taken back to cave roosts to eat.
</paragraph>
<paragraph blockId="7.[207,1830,934,2188]" box="[210,709,1147,1186]" pageId="7" pageNumber="5">
<emphasis box="[210,383,1147,1186]" pageId="7" pageNumber="5">Breeding</emphasis>
. No information.
</paragraph>
<paragraph blockId="7.[207,1830,934,2188]" pageId="7" pageNumber="5">
<emphasis bold="true" box="[207,531,1200,1239]" pageId="7" pageNumber="5">Activity patterns.</emphasis>
Troughton’s Sheath-tailed Bat roosts in caves, mines and tunnels, rock crevices, and 
<emphasis box="[570,673,1253,1292]" italics="true" pageId="7" pageNumber="5">rocky</emphasis>
escarpments. Echolocation call is less than 25 kHz and distinguishes it from the Common Sheath-tailed Bat (. 
<emphasis box="[1219,1418,1306,1345]" italics="true" pageId="7" pageNumber="5">
<taxonomicName authorityName="Thomas" authorityYear="1915" box="[1219,1406,1306,1345]" class="Mammalia" family="Emballonuridae" genus="Taphozous" kingdom="Animalia" order="Chiroptera" pageId="7" pageNumber="5" phylum="Chordata" rank="species" species="georgianus">georgianus</taxonomicName>
)
</emphasis>
where they 
<emphasis box="[1653,1813,1306,1345]" italics="true" pageId="7" pageNumber="5">co-occur</emphasis>
. 
<emphasis bold="true" box="[210,1203,1360,1399]" pageId="7" pageNumber="5">Movements, Home range and Social organization.</emphasis>
Large colonies of Troughton’s Sheath-tailed Bat can be found in landscapes with abundant 
<emphasis box="[1348,1452,1413,1452]" italics="true" pageId="7" pageNumber="5">rocky</emphasis>
outcrops, especially in tower karst. Colony size might be limited by roosting structures, especially in more arid areas where there are few caves deep enough to support large colonies.
</paragraph>
</subSubSection>
</paragraph>
</subSubSection>
<subSubSection pageId="7" pageNumber="5" type="conservation">
<paragraph blockId="7.[207,1830,934,2188]" pageId="7" pageNumber="5">
<emphasis box="[211,667,1572,1611]" pageId="7" pageNumber="5">
Status 
<emphasis bold="true" box="[332,667,1572,1611]" pageId="7" pageNumber="5">and Conservation</emphasis>
</emphasis>
. Classified as Least Concern on 
<emphasis box="[1261,1587,1572,1611]" italics="true" pageId="7" pageNumber="5">TheIUCNRed List.</emphasis>
Troughton’s Sheath-tailed Bat has a large distribution and 
<emphasis box="[1068,1289,1625,1664]" italics="true" pageId="7" pageNumber="5">presumably</emphasis>
large and stable overall population, uses a wide variety of habitats, occurs in protected areas, and does not face significant threats. It was originally recorded only from a small area in the Mount Isa Inland bioregion of Queensland, but recent studies based on isozymes and echolocation calls extend distribution further east throughout much of interior and near coastal region of central Queensland, formerly attributed to the Common Sheathtailed Bat. 
<emphasis box="[412,545,1943,1982]" italics="true" pageId="7" pageNumber="5">Recent</emphasis>
reports of absence of Troughton’s Sheath-tailed Bat in western parts of its distribution require additional verification, possibly leading to re-evaluation of its conservation status after taxonomic issues are clarified.
</paragraph>
</subSubSection>
<subSubSection pageId="7" pageNumber="5" type="reference_group">
<paragraph blockId="7.[207,1830,934,2188]" pageId="7" pageNumber="5">
<emphasis bold="true" box="[213,419,2108,2135]" pageId="7" pageNumber="5">Bibliography.</emphasis>
Chimimba &amp; Kitchener (1991), Hall (2008b), McKean &amp; Price (1967), Reardon &amp; 
<emphasis box="[1579,1718,2108,2135]" italics="true" pageId="7" pageNumber="5">Thomson</emphasis>
(2002), Tate (1952),Thomson 
<emphasis box="[539,589,2161,2188]" italics="true" pageId="7" pageNumber="5">eta</emphasis>
/. (2001), 
<emphasis box="[725,869,2161,2188]" italics="true" pageId="7" pageNumber="5">Woinarski</emphasis>
eta/. (2014).
</paragraph>
</subSubSection>
</treatment>
</document>

please let me know if you need more info to find the origin of the docOrigin text.

flsimoes commented 2 years ago

Still everything looks alright from our end. All the other treatments from the same "chapter" have the correct metadata, this should also have it. Just in case, I've re-triggered the upload with a new version. Let's see if it helps.

jhpoelen commented 1 year ago

HMW are now also available in separate csv files: hmw-volume-1.csv, hmw-volume-2.csv, ... . as discussed.

See also attached screenshot.

Screenshot from 2022-08-30 14-12-39