NCIOCPL / glossary-api

API for Dictionary of Cancer Terms, Dictionary of Genetics Terms, and other Glossary documents.
0 stars 5 forks source link

Letters with diacritics are treated as "not a letter." #137

Closed blairlearn closed 1 year ago

blairlearn commented 3 years ago

Issue description

Letters with diacritics, e.g. Á, are grouped with "not a letter" instead of the normalized letter.

This is an issue in the loader.

ESTIMATE 20

Steps to reproduce the issue

  1. Go to https://www.cancer.gov/espanol/publicaciones/diccionario
  2. Click on the # in the AtoZ list.
  3. Scroll down past the terms starting with 1

What's the expected result?

What's the actual result?

Additional details / screenshot

Related Tickets

bkline commented 3 years ago

Changed requirements implemented on DEV. New test set attached.

glossary.zip

blairlearn commented 1 year ago

This has already been implemented.