neurodata / ProgLearn

NeuroData's package for exploring and using progressive learning algorithms
https://proglearn.neurodata.io
Other
35 stars 42 forks source link

support streaming (and possibly parallel) decision tree option #34

Open jovo opened 4 years ago

jovo commented 4 years ago

i think probably the best implementation is from this: https://github.com/huawei-noah/streamDM but it is spark. it is based on this paper: https://dl.acm.org/doi/10.1145/347090.347107

also another implementation: https://github.com/soundcloud/spdt

jovo commented 4 years ago

some additional relevant papers:

finally, note that sklearn does not currently support this functionality: https://scikit-learn.org/stable/modules/computing.html#incremental-learning

PSSF23 commented 4 years ago

Interested in this issue for Sprint 1. Proposed DoD: Allow transformers to use streaming data for training.

KevinWang905 commented 3 years ago

Could I please be assigned to this issue? Thanks in advance.

nhahn7 commented 3 years ago

Could I please be assigned to this issue as well?