thekingofkings / chicago-partition

Automatically partition Chicago into Community Areas (CA), while minize the CA level crime prediction error.
MIT License
1 stars 1 forks source link

Add other features as predictors #10

Open thekingofkings opened 6 years ago

thekingofkings commented 6 years ago

If we follow the prediction setting of previous papers, the point-of-interest (POI) features, spatial distance features, and taxi flow features should also be added to predict the target (crime / house price).

POI features

Use the POI features generated from foursquarePOI.py. Raw count of POI in each category is preferred over the normalized percentage.

Spatial distance features

Top 6 closest neighboring CAs should be considered. The reverse distance weight is applied on the target variable of nearby CAs.

An adjacency matrix of all CAs should be maintained. Each time a flip tract label actions may change this adjacency matrix.

Taxi flow features

Use the taxi flow features generated from taxiFlow.py

A pairwise taxi flow count matrix should be maintained.