tdwg / dwc-qa

Public question and answer site for discussions about Darwin Core
Apache License 2.0
49 stars 8 forks source link

Added ALA directory and data files #125

Closed nickdos closed 6 years ago

nickdos commented 6 years ago

As requested in https://github.com/VertNet/dwc-qa-manage/issues/39.

ansell commented 6 years ago

These are the processed values, not the raw data that we are collecting from data providers and giving to GBIF. (Not sure if that is an issue)

debpaul commented 6 years ago

Raw would be great too.

Sent from Shoe (my iPhone)

On Jul 4, 2018, at 5:49 PM, Peter Ansell notifications@github.com<mailto:notifications@github.com> wrote:

These are the processed values, not the raw data that we are collecting from data providers and giving to GBIF. (Not sure if that is an issue)

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_tdwg_dwc-2Dqa_pull_125-23issuecomment-2D402563736&d=DwMCaQ&c=HPMtquzZjKY31rtkyGRFnQ&r=ODXYRdWm1Oqf5-w5G2NjQw&m=uQ1tBJUFflkJcBoyWojx0pIPQZo37WdQ6mr0R_mGQ18&s=KdsIQpFqgC4G0pWVVQpGEA8jvq_4K3Jv57JJxYCuoHs&e=, or mute the threadhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AC2gS2V8reKoT4SolQlv0PSTV30aL49Rks5uDThugaJpZM4VB-5Fv9&d=DwMCaQ&c=HPMtquzZjKY31rtkyGRFnQ&r=ODXYRdWm1Oqf5-w5G2NjQw&m=uQ1tBJUFflkJcBoyWojx0pIPQZo37WdQ6mr0R_mGQ18&s=fZTjIrnlVQLCVrXOl91lZ6nS9KIZtb7pRN2THpoJoaQ&e=.

nickdos commented 6 years ago

They are actually a mixture of raw and processed values as I used any field that was indexed in SOLR. I've pasted the mapping below - key is ALA version and value is DwC term:

DWCMAP[basis_of_record]=basisOfRecord
DWCMAP[country_code]=countrycode
DWCMAP[country]=country
DWCMAP[month]=month
DWCMAP[year]=year
DWCMAP[establishment_means]=establishmentMeans
DWCMAP[raw_identification_qualifier]=identificationQualifier
DWCMAP[license]=license
DWCMAP[occurrence_status_s]=occurrenceStatus
DWCMAP[reproductive_condition_s]=reproductiveCondition
DWCMAP[raw_sex]=sex
DWCMAP[rank]=taxonRank
DWCMAP[type_status]=typeStatus
tucotuco commented 6 years ago

Maybe a README.md file to explain the content? In particular how to get the raw data from the data set. This exercise is mostly to determine what one finds in the wild. And thanks!

On 00:50, Thu, Jul 5, 2018 Nick dos Remedios notifications@github.com wrote:

They are actually a mixture of raw and processed values as I used any field that was indexed in SOLR. I've pasted the mapping below - key is ALA version and value is DwC term:

DWCMAP[basis_of_record]=basisOfRecord DWCMAP[country_code]=countrycode DWCMAP[country]=country DWCMAP[month]=month DWCMAP[year]=year DWCMAP[establishment_means]=establishmentMeans DWCMAP[raw_identification_qualifier]=identificationQualifier DWCMAP[license]=license DWCMAP[occurrence_status_s]=occurrenceStatus DWCMAP[reproductive_condition_s]=reproductiveCondition DWCMAP[raw_sex]=sex DWCMAP[rank]=taxonRank DWCMAP[type_status]=typeStatus

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/tdwg/dwc-qa/pull/125#issuecomment-402599394, or mute the thread https://github.com/notifications/unsubscribe-auth/AAcP60PkxNmQNqYoZV6W-6SIjHx9Qgs5ks5uDY0CgaJpZM4VB_v9 .