JamesYang007 / adelie

A fast and flexible Python package for solving group lasso and elastic net problems.
https://jamesyang007.github.io/adelie/
MIT License
15 stars 1 forks source link

scikit compatible learners #114

Open apoorvalal opened 1 month ago

apoorvalal commented 1 month ago

Hi James,

great work on this. Was wondering if you were planning to provide a scikit-learn style API for this package? Would largely involve writing init/fit/predict/score methods, which would then allow adelie to be used in a number of useful downstream tasks (e.g. what i'm interested in - nuisance function estimation for AIPW / Rlearner for ATE/CATE estimation in econml / doubleml). Celer is a closely related package that provides a scikit-style API.

Totally understand if this isn't a priority; I might take a crack at it and send you a PR in case you're open to contributions.

JamesYang007 commented 1 month ago

Hi Apoorva,

Thanks so much for your interest! Yes, that was on my list of TODOs but it is indeed a low priority right now. I'd be happy to accept any contributions. If you submit a PR with a sketch of what you're looking for, I can flesh out more of the detail if necessary.

darior commented 1 month ago

Hi James, thanks a lot for building and sharing this package! I work in genomics and found it useful for problems where other tools were prohibitively slow.

It took a bit of work to implement the training and evaluation workflow, but code examples examples were very helpful. I wanted to add a couple suggestions to improve the API and noticed this issue so I'll just comment here as they're related.

Couple suggestions below:

JamesYang007 commented 1 month ago

Hi Darek, thanks for your interest as well! I'm very glad to hear that adelie has been suitable for your purpose. Indeed, the initial motivation for my developing adelie was to run group lasso on GWAS datasets as no existing tools could support such a scale of data.

Curious to hear your thoughts!

darior commented 1 month ago

Hi James,