code-312 / rescue-chicago

Repository for work related to a interactive data dashboard that can be used to analyze how different dog characteristics may correlate with average length of stay in a shelter prior to adoption.
https://code312-rescue-trends-2659be78e6b4.herokuapp.com/
1 stars 0 forks source link

Remove Outliers #11

Closed kaylarobinson077 closed 1 year ago

kaylarobinson077 commented 2 years ago

There's some crazy data that we ingest from PetFinder - like dogs that have apparently been up for adoption for 10+ years. We'll want to remove any obviously wrong data from our database, so that it doesn't lead to misleading conclusion in the dashboards.

Some ideas for analysis to help inform outlier removal:

Jared-Kunhart commented 1 year ago

Data from more cities is being pulled in and the outliers are becoming more apparent. Currently on the Breed Trends by LoS out of 20 breeds - 7 of those breeds are under a count of 5.

Jared-Kunhart commented 1 year ago

Made a branch for this issue - 11-remove-outliers

Jared-Kunhart commented 1 year ago

Dynamic slider gives filter outliers to the user. In addition, Jhen has also filtered outliers in the model preprocessing.