Closed bellecarrell closed 5 years ago
going to verify functionality tomorrow
@abenton for the standard histogram, what should I make the bins? If I let it autoset it will create very non-uniform bins, like shown here. To implement the solution, I need to provide bins. I think I came up with bins somewhat randomly in an earlier iteration.
How about every 100 users to start out, then decide how to further aggregate based on the distribution you observe.
[x] ensure no duplicates for any using user id instead of json string
[x] implement np.cumsum
[x] update log follower percentiles post duplicate fix
[ ] update follower histogram post duplicate fix