ShelterApp / AddResources

http://shelterapp.org/
11 stars 10 forks source link

Manually spot check tmp collections for false positives. #3

Open prabhushrikant opened 4 years ago

prabhushrikant commented 4 years ago
prabhushrikant commented 4 years ago

Following checks can be done:

  1. If the resource is providing more than one service is it duplicated in the csv data? Should we then also match on category while dupe checking?
  2. Abbreviations (St. vs Saint etc)
  3. Spelling mistakes
  4. Spit out the list of dropped services for manual checking.
prabhushrikant commented 3 years ago

We have now collected a lot of tmp data collection but we would like to improve our partial matching logic. Spot check the data collected in tmp collections like tmpIRS or tmpHUD and see if those resources are already found in other collections such as services or tmpILMS

Also check if tmp duplicates are actaully duplicates found in some collection and not false positives