thekingofkings / chicago-crime

Crime correlation anaysis
MIT License
11 stars 3 forks source link

Figure out the setting of KDD submission #25

Closed thekingofkings closed 7 years ago

thekingofkings commented 7 years ago

There is a table "comparing different feature settings for overall crime" in KDD paper. Now the experiment setting to get this table is not clear.

Download the release v2.0 to figure this out.

thekingofkings commented 7 years ago

Confirmed Settings

  1. Demos are from Corina. There is no issue with demo features.
  2. POI features are POI count at each category for every region.

Known Difference

The leaveOneOut evaluation in KDD submission has minor issues. It normalizes generates all features first, and then leave the testing sample out. In this case, the geo and taxi features for training samples actually contain information about the crime in testing sample.

thekingofkings commented 7 years ago

Bugs in previous normalization by destination

Known bug in #2 #8. The transpose is missing. We need transpose, because we are multiplying flow matrix with the crime vector.