njdowdy / tpt-taxonomy

Foundational taxonomic resources for the TPT project
GNU General Public License v3.0
6 stars 1 forks source link

Having 4 versions of each file is extremely confusing #26

Open zygoballus opened 1 year ago

zygoballus commented 1 year ago

I understand that different users may want different formats of data, but why, for example, do we have both Siphonaptera-standardized-v2.csv and siphonaptera-standardized.csv? Would there be any harm in deleting siphonaptera-standardized.csv (and the equivalent in each directory)? It would certainly help reduce confusion. @Jegelewicz

zygoballus commented 1 year ago

Also, it looks like the only formatting difference between "standardized" and the original format is the addition of more granular higher-level taxonomy, thus it should be trivial to generate a "v2" version of the data in the original data format (by just deleting the extra columns). Then we could dispense with the "v2"s altogether since all the files would be v2 (assuming we delete the old standardized versions).

Jegelewicz commented 1 year ago

@zygoballus I believe this is the versioning process for all of the TPT taxonomies. @njdowdy