TestBetaGeo `setup_class` is too slow

pymc-labs / pymc-marketing

Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.

Apache License 2.0

614 stars 148 forks source link

I'm working on the ParetoNBD PR right now, and have added pytest fixtures for CDNOW_sample.csv and CDNOW_master.csv which could potentially resolve this issue. These CSVs contain 2,357 and 23,570 rows respectively, and any tests using them should probably be marked as @pytest.mark.slow.

My opinion on using CDNOW for testing has flip-flopped in recent weeks because even CDNOW_master is much smaller than many datasets encountered in practice, but these are real-world benchmarks used in many research papers, and will be useful for testing against lifetimes MLE convergence and reproducing research results.

pymc-labs / pymc-marketing

TestBetaGeo `setup_class` is too slow #172