Closed bepetersn closed 10 years ago
@bepetersn, I took a quick look at your example and I suspect that your intuition is right, these records were probably created with a less good version of our ILCS parsing code. Rerunning statute2iucr should fix this. We would also have to recreate the convictions from the disposition records, but that's no big deal. Thanks for looking into this.
While going very carefully over the data in our database, I found that there is some mysterious data whose origin I can't understand. Not too much, but a little. For example, searching by the charge description
HARASSMENT BY TELEPHONE
...The output is:
Fine, right? Except that doing an equivalent search over the IUCR crosswalk yields totally different IUCR codes and categories.
The output is:
This doesn't seem possible, at least to my current understanding of how we generated our IUCR categories and codes. My understanding is that the
statute2iucr
management command was originally run to generate codes and categories, which itself relies on just the same method I used,get_iucr()
, and the crosswalk behind the scenes. However, as you can see, the crosswalk doesn't have these values.The reason this is worth pointing out is that, at least by the crosswalk, it would seem we can map from the charge description
HARASSMENT BY TELEPHONE
to the IUCR categoryDisorderly Conduct
reliably, which is part of the task of #6.Perhaps the
statute2iucr
command simply needs to be run again. In the meantime, it's easy to just work around this by doing a double check in the way I did above, thatget_iucr()
also contains an IUCR category, given a statute.