ternaustralia / ausplotsR

R package to interact with TERN AusPlots data
GNU General Public License v3.0
10 stars 6 forks source link

Update species_table to reflect name filtering options #18

Closed GregGuerin closed 4 years ago

GregGuerin commented 4 years ago

With new taxonomy fields from the database, add user option to use herbarium_determination or new standardised name field(s) - making it easy to get a table of standardised names or even genus or family, species level or lowest available level.

Sammunroe commented 4 years ago

I will try and do some prelim work on adding user option to use herbarium_determination or new standardised name field(s), but we will have to wait to see the new fields to strip out non-vascular plants and incomplete records.

Sammunroe commented 4 years ago

I have created some new code to select the herbarium_determination or new standardised name field. But will we want to think some more about how to approach incomplete names. Beyond the obvious ones, such as "no_id", how do we want to treat ids that are only down to the genus level? Excluding an accurate genus ID would needlessly lower diversity estimates, but presumably the new standardized name field will only include genus_species if available? I think the standardised name field may need to be re-thought of as the "lowest possible identification", then we can have other columns that only give the genus_species if available?

GregGuerin commented 4 years ago

The standardised name data should be able to return Genus sp. records - need to see an example of the output though! They should be included by default but with an option/argument for stripping them out for 'clean' species lists/matrices (because for example they could be treated as the same taxon in turnover analysis). Again, hard to know the best way to filter on that without seeing the fields - could be a flag for ID level, 'sp.' as epithet etc - remembering that 'phrase names' also have sp. in the name.

Sammunroe commented 4 years ago

I have some code that can "strip" the species name down to genus_species, I wrote that for my own work last year

I could create something which then deletes all the "hits" from veg.PI that do not provide a genus_species, but I am not sure that would easier than just having another column that flags the ID level. Much cleaner code if we add a flag for ID level.