Closed M-Nicholls closed 1 year ago
Created DwCA transformation code - https://github.com/charvolant/dwca-ala-transform
Waiting for comments. Also, would this be a good thing to put into the biocache store while we're about it?
DWCA end-point for sample http://root.ala.org.au/bdrs-core/nswww/webservice/application/downloadDwca.htm
Developed mapping service
Script for ACT and Southern Tablelands Weed Spotter built.
Loaded ACT BioBlitz to http://collections.ala.org.au/public/show/dr4577
There are a large number (~15%) of 'Unlisted species' entries. These are loaded, since someone saw something there. There are catalogNumbers so subsequent correct identifications can be uploaded.
Loaded ACT and Southern Tablelands Weed Spotter (just surveys 169 and 206) to http://collections.ala.org.au/public/show/dr1109 Updated jenkins script on mccartney to load using the DwCA transformer on dury
Loaded Atlas of Life in the Coastal Wilderness to http://collections.ala.org.au/public/show/dr702 and updated jenkins job
Loaded Condamine Alliance to http://collections.ala.org.au/public/show/dr4608
Weed observation and Animal observation surveys only. Filter is datasetID == "urn:lsid:bdrs.ala.org.au:Survey:89" or datasetID == "urn:lsid:bdrs.ala.org.au:Survey:188" or datasetID == "urn:lsid:bdrs.ala.org.au:Survey:211"
CCSA Feral Or Peril has absence records for survey 86 and access via http://feralperil.ala.org.au/bdrs-core/portal/25/webservice/application/downloadDwca.htm
CSIRO Yellow Box access via http://yellowbox.ala.org.au/bdrs-core/portal/22/webservice/application/downloadDwca.htm loaded to http://collections.ala.org.au/public/show/dr4611
Flora and Fauna of Wamboin and Bywong available at http://root-uat.ala.org.au/bdrs-core/ihv/webservice/application/downloadDwca.htm loaded to http://collections.ala.org.au/public/show/dr4612
GER-K2C loaded to http://collections.ala.org.au/public/show/dr4614
GER-S2S at http://s2s.ala.org.au/bdrs-core/portal/12/home.htm access via http://s2s.ala.org.au/bdrs-core/portal/12/webservice/application/downloadDwca.htm loaded to http://collections.ala.org.au/public/show/dr4621 set up jenkins job for monthly harvesting
GER-SH at http://sh-birds.ala.org.au/bdrs-core/portal/10/home.htm access via http://sh-birds.ala.org.au/bdrs-core/portal/10/webservice/application/downloadDwca.htm loaded to http://collections.ala.org.au/public/show/dr4625 set up jenkins job for monthly harvesting
Mangrove Watch at http://www.mangrovewatch.org.au No direct access to data; need contact.
Contacted. Data is available at http://root.ala.org.au/bdrs-core/mangroves/webservice/application/downloadDwca.htm To be loaded to http://collections.ala.org.au/dataResource/show/dr4672 once rufus gets on its funky way
Bird Atlas NSW at http://root-uat.ala.org.au/bdrs-core/nsw-ba/home.htm access via http://root-uat.ala.org.au/bdrs-core/nsw-ba/webservice/application/downloadDwca.htm there's an existing upload of http://collections.ala.org.au/public/show/dr1089 with 3m-odd records. There's also another empty DR at http://collections.ala.org.au/public/showDataResource/dr3378 All records from BDRS look like they've arrived after the main upload, so adding to dr3378
Bathing Birds at root.ala.org.au/bdrs-core/npansw/home.htm access via root.ala.org.au/bdrs-core/npansw/webservice/application/downloadDwca.htm
Only including records with datasetID == "urn:lsid:bdrs.ala.org.au:Survey:289"
it would be nice to add and not deleteThisRecord == "true"
but the filter works on the original record, not the transformed record. So manually deleting from CSV file.
NSW Waterwatch at http://root.ala.org.au/bdrs-core/nswww/home.htm loaded to http://collections.ala.org.au/public/show/dr4645 set up jenkins job to do monthly harvesting.
Note that the mapping file must be encoded in ISO-8859-1 for tomcat to interpret the file correctly. Otherwise, the micro symbol is encoded incorrectly. See https://wiki.apache.org/tomcat/FAQ/CharacterEncoding
Redland City council at http://root.ala.org.au/bdrs-core/rcc/home.htm via root.ala.org.au/bdrs-core/rcc//webservice/application/downloadDwca.htm loaded to http://collections.ala.org.au/public/show/dr4652 set up jenkins job for monthly harvesting
Scenic Rim Regional Council at http://root.ala.org.au/bdrs-core/scenicrim/home.htm Has one record in it. Created data resource and loaded to http://collections.ala.org.au/public/show/dr4664 no automated upload since any further work will be via biocollect
Scotland Island at http://root.ala.org.au/bdrs-core/scotis/home.htm has one record. Uploaded to http://collections.ala.org.au/public/show/dr4665 no automated upload since any further work will be via biocollect
Tweed Council Koala count http://root.ala.org.au/bdrs-core/tweed/home.htm and Tidbinbilla http://root.ala.org.au/bdrs-core/tbbilla/home.htm also only have a few records between them.
Tweed council uploaded to http://collections.ala.org.au/public/show/dr4669
Tidbinbilla uploaded to http://collections.ala.org.au/public/show/dr4667
Upper Murrumbidgee Waterwatch at http://root.ala.org.au/bdrs-core/umww/home.htm via http://root.ala.org.au/bdrs-core/umww/webservice/application/downloadDwca.htm loaded to http://collections.ala.org.au/public/show/dr4659 Active site, set up jenkins harvesting job
Weed biological control at http://root.ala.org.au/bdrs-core/wbiocont/home.htm via http://root.ala.org.au/bdrs-core/wbiocont/webservice/application/downloadDwca.htm
Requires generating a second record for the host
BDRS for some reason encodes John O'Grady
as 'John O"'Grady'
This doesn't pass muster in the GBIF DWCA reader for some completely inexplicable reason totally unrelated, no doubt, to it not being a sensible encoding.
Added fixBdrs
option to dwc-archive-plugin to fix escape errors
http://root.ala.org.au/bdrs-core/nswww/home.htm