Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.
599
stars
262
forks
source link
Project: Classify Kaggle San Francisco Crime Description
Highlights:
- This is a multi-class text classification (sentence classification) problem.
- The goal of this project is to classify Kaggle San Francisco Crime Description into 39 classes.
- This model was built with CNN, RNN (LSTM and GRU) and Word Embeddings on Tensorflow.
- Input: Descript
- Output: Category
-
Examples:
Descript |
Category |
GRAND THEFT FROM LOCKED AUTO |
LARCENY/THEFT |
POSSESSION OF NARCOTICS PARAPHERNALIA |
DRUG/NARCOTIC |
AIDED CASE, MENTAL DISTURBED |
NON-CRIMINAL |
AGGRAVATED ASSAULT WITH BODILY FORCE |
ASSAULT |
ATTEMPTED ROBBERY ON THE STREET WITH A GUN |
ROBBERY |
Train:
- Command: python3 train.py train_data.file train_parameters.json
- Example:
python3 train.py ./data/train.csv.zip ./training_config.json
Predict:
- Command: python3 predict.py ./trained_results_dir/ new_data.csv
- Example:
python3 predict.py ./trained_results_1478563595/ ./data/small_samples.csv
Reference: