Open timothyclemansinsea opened 8 years ago
Aaaargh! They deleted data I used??? Noooo... I don't think I downloaded all the columns -- I just got the ones I needed. Rats.
What are RMS and DSG and CAD here?
RMS is Report Management System, CAD is Computer Aided Dispatch, and DSG is data.seattle.gov
On Tue, Apr 26, 2016 at 9:34 PM, Pat Tressel notifications@github.com wrote:
Aaaargh! They deleted data I used??? Noooo... I don't think I downloaded all the columns -- I just got the ones I needed. Rats.
What are RMS and DSG and CAD here?
— You are receiving this because you authored the thread. Reply to this email directly or view it on GitHub https://github.com/ptressel/seattle_crime_vs_rain/issues/2#issuecomment-214967548
Ah, thanks!
We should re-start this issue over in the openseattle/crime_analysis repo.
Since your paper all of the data was deleted by SPD in attempt in deal with duplicates caused by beats boundary change. Not all the events there were there before have been reuploaded. They should have made a primary key and done upserts to prevent the problem. And instead of deleting all the data and rerunning the slow process, very slow due to geocoding, they should have used an "intelligent script" to remove duplicates. Additionally SPD's crime data on data.seattle.gov has been problematic since day one because they filter out such a wide range incidents. Unfortantely I stupidly deleted my records requested copy of the filtering rules. But there are so many ways an event could get filtered. I have a records request in for the complete listing of events without address column. https://queryplayground.com/#/playset/-KGFf-gNziWJ8x2bDXp8 charts the RMS data on DSG now and you can see it doesn't go as far back as it used too and Jan/Feb 16 are too low.
I'd like to see a discussion of why one should use RMS data instead of CAD data which has so many more events. I take it is because RMS events are more likely to be considered "real crimes".