PanamaForestGEO / Panama_plant_species_lists

databases of scientific names of plants (mostly woody plants) in Panama, together with 6-letter and 4-letter codes for these species used in Smithsonian research, and histories of changes in species names associated with these codes.
0 stars 0 forks source link

Write code to harmonize / bring together information in the different files. #2

Open hmullerlandau opened 3 years ago

hmullerlandau commented 4 months ago

This should be in the form of an R markdown that generates a report and includes the following: (1) Checking each file individually for duplicate names or codes. Any duplicates should be enumerated in tables within the report, and flagged in the files. (2) Running each species name in each file against TNRS (Taxonomic Name Resolution Service) using the tnrs R package to match with the current accepted name. Search for cases where the same accepted name is associated with more than one current name within each file and across files. Include tables of these cases within the report. (3) Checking across the files for cases in which the same 6-letter code or 4-letter code is associated with different current species names and cases with different current accepted names. Include tables of these cases within the report. (4) Merge all records from all files. In cases where a single file has more than one row with a particular value in the merged field, keep just one in the merged file and generate separate files of excluded duplicates.