miladrezazadeh / twitter_depression_detection

This app is able to detect depressive characteristics in a given tweets and the classify it into depressive and Non-depressive.
MIT License
7 stars 0 forks source link

Code Review - Round 2 #2

Closed rfazeli closed 3 years ago

rfazeli commented 3 years ago

Great work so far!

Some comments/feedback on your data_cleaning.ipynb notebook:

  1. Make sure function names are consistent for example getAdjectives() should be modified to get_adjectives()
  2. You should save your processed/cleaned data to the data/processed/ directory
  3. Then split your notebook into multiple notebooks
    1. data_cleaning.ipynb -> To clean the data and save the final processed dataset to disk
    2. modeling.ipynb -> Load the data from disk and build models.
  4. It might be good to look at the precision vs recall curve as well as the ROC curve. You might not see a huge difference since your dataset is already balanced. But there's no harm in looking at that too.
  5. You can also save the word cloud to an img/ directory
  6. Make sure you push the .py version of the code as well.
  7. See if you can find any patterns in the False Positive and False Negative examples. Are there any patterns that the model consistently gets wrong?
  8. Have you tried playing around with your model to see what it can or cannot predict correctly. For example, type a few sentences you expect the model to predict as depressive or non-depressive and see what the model predictions are. And see if you can observe any patterns.