Describe the taskApplicant Name column is inconsistent. For instance, "ABC & Company" could also be spelled "ABC and Co". Understanding which spelling or name is correct is impossible without human judgment. The person uploading a dataset should get a warning message (before the dataset is finally uploaded). This warning message should contain a list of lists of values that are very similar, indicating that it is possible the same person/entity. From there the user can work on their spreadsheet to fix issues and try uploading again. The user should also have an option to ignore this message and proceed to upload regardless.
Purpose
It normalizes the data for easier retrieval and proper groupings.
Acceptance Criteria
[ ] Similar sounding names are displayed to the user in a warning before the final upload
[ ] User can ignore the warning and proceed to the upload
Additional context
Please see GER_notes_for_Agile_team, section 3 for my implementation using python library difflib
Describe the task
Applicant Name
column is inconsistent. For instance, "ABC & Company" could also be spelled "ABC and Co". Understanding which spelling or name is correct is impossible without human judgment. The person uploading a dataset should get a warning message (before the dataset is finally uploaded). This warning message should contain a list of lists of values that are very similar, indicating that it is possible the same person/entity. From there the user can work on their spreadsheet to fix issues and try uploading again. The user should also have an option to ignore this message and proceed to upload regardless.Purpose It normalizes the data for easier retrieval and proper groupings.
Acceptance Criteria
Additional context