Open K-Ellis opened 7 years ago
compared to when the uncleaned incident data is used:
Huh, any idea what's causing it?
While there is a pattern, might not be as bad as it first looks. Can we plot them with the same y axis? Uncleaned data has one outlier messing it up
Not sure.. I'll try deleting the outlier and plotting it again
Here's the uncleaned data:
And then the cleaned data:
There's definitely something strange going on.. They look pretty similar up to day ~105, then the variability decreases massively and the last 20 days of data go missing
If we think about the data we have, anything created recently (towards the end of quarter) might not yet be closed out. Is there a filter we are applying that could be causing it?
Maybe park it and test out when we get the next dataset to see if it disappears?
Take a look at the created_on_year plot for when the cleaned incident data is used in the seasonality study:
https://github.com/K-Ellis/Predicting-Transaction-Times/blob/master/5th%20Iteration/0.%20Results/Kieron/data_understanding/seasonality_study/2017.07.24/Created_On_Year.png