Closed mao1756 closed 8 months ago
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
@ayhncgty Please review what I found, and if you agree, please approve the pull request. Once approved I will merge the PR.
@abdullahnaeemmalik If you have time, please have a look on it as well.
I added an analysis of which data points might be misleading and how to remove them. This addresses Issue #26. Summary:
Law enforcement facilities
It seems that wide variety of reports are concentrated at law enforcement facilities. Although we wouldn't check with TPD, it is possibly because they are where enforcement officers received the report. That is, they are not the location of the incident, rather where the incident report was received.
Following this hypothesis, we suggest not using data at LE facilities for
pix2pix
, but I believe using them for statistical analysis that does not involve the location of the crime is fine.Misleading categories
"COMMUNITY POLICING", "ADMIN", "RECOVERED PROP", "AUTO THEFT RECOVERY", "UNABLE TO VERIFY" and "TRAFFIC" should not be included in our dataset.
I also have a code on how to remove these reports in the notebook.