This PR builds upon functionality created by @damianooldoni in get_taxa_populate_spreadsheet branch, with major revisions.
get_taxa.Rmd
Reviewed + outputs ALL checklist taxa, with validDistribution column.
Output:
data/raw/checklists.csv
data/raw/taxa.csv
get_information.Rmd
Replaces standardize_save_unified_checklist.Rmd (I think) and queries related information for taxa that have a valid distribution.
Output:
data/raw/distributions.csv
data/raw/speciesprofiles.csv
data/raw/descriptions.csv
verify_taxa.Rmd
Renamed verify_synonyms.Rmd, but otherwise untouched. Should be updated in next step.
unify_information.Rmd
Replaces unify_info_from_checklists.Rmd and uses a simplified way of unifying the information (first within checklist, than across checklist for all related information), but generally still follows the description of the Google Doc.
Output:
data/unified/distributions.csv
data/unified/speciesprofiles.csv
data/unified/descriptions.csv
Further
Unused or renamed data files have been removed
Some data directories need cleanup still
Added a test directory
The verify_taxa.Rmd step is skipped for now (unify_information.Rmd unifies on bb_key)
taxa.csv itself is not yet unified
DwC data can be build from information in data/unified
This PR builds upon functionality created by @damianooldoni in
get_taxa_populate_spreadsheet
branch, with major revisions.get_taxa.Rmd
Reviewed + outputs ALL checklist taxa, with
validDistribution
column.Output:
data/raw/checklists.csv
data/raw/taxa.csv
get_information.Rmd
Replaces
standardize_save_unified_checklist.Rmd
(I think) and queries related information for taxa that have a valid distribution.Output:
data/raw/distributions.csv
data/raw/speciesprofiles.csv
data/raw/descriptions.csv
verify_taxa.Rmd
Renamed
verify_synonyms.Rmd
, but otherwise untouched. Should be updated in next step.unify_information.Rmd
Replaces
unify_info_from_checklists.Rmd
and uses a simplified way of unifying the information (first within checklist, than across checklist for all related information), but generally still follows the description of the Google Doc.Output:
data/unified/distributions.csv
data/unified/speciesprofiles.csv
data/unified/descriptions.csv
Further
verify_taxa.Rmd
step is skipped for now (unify_information.Rmd
unifies onbb_key
)data/unified