alrichardbollans / wcvpy

Packages for downloading the World Checklist of Vascular Plants (WCVP) and resolving plant names to it
GNU General Public License v3.0
2 stars 0 forks source link

Space-like characters #8

Closed alrichardbollans closed 1 year ago

alrichardbollans commented 2 years ago

Characters that may look like spaces in data can cause more conservative matching, as the characters differ from WCVP data

alrichardbollans commented 2 years ago

E.G. Currently 'Strophanthus vanderijstii[NBSP]Staner' resolves to the genus. Fix this by either raising an error with these kinds of spaces or replacing them with v = value.replace("\xa0", " ")

alrichardbollans commented 2 years ago

Similarly, 'nan's in taxa lists cause errors in get_acc_info_from_matches as nans are in in_df but not in final_resolved_df