rasbt / mlxtend

A library of extension and helper modules for Python's data analysis and machine learning libraries.
https://rasbt.github.io/mlxtend/
Other
4.83k stars 855 forks source link

Improve Apriori efficiency #899

Open rasbt opened 2 years ago

rasbt commented 2 years ago

This is an issue to revive the awesome work on the apriori algorithm in #646. This issue is partly to bump this up before the code in master further diverges from this excellent PR.

Nachiket18 commented 2 weeks ago

Since frequent itemset mining is computationally intensive we use cython to speed up the code?

rasbt commented 2 weeks ago

You could do that, but I think reviving the PR linked there would be a recommended first step, and then someone can always add Cython on top of it.

Nachiket18 commented 1 week ago

Okay that sounds good. Should we go through research papers about improvements in apriori algorithm along with the original as mentioned in the PR?

@UltraArceus3 @ankithn30 @RonakGSahu would be interested in participating.

rasbt commented 1 week ago

Good question (and sorry about the late response, I've been both moving and prepping a conference workshop). I would say that the improvements in the PR is the highest-impact one because the original apriori I implemented is a bit inefficient, and the "real" one from there should be much faster.