-
The efforts devoted to this line seem to slow down in the past several months.
(Regularized) empirical risk minimization standards squarely at the center of machine learning, and there's been extensi…
-
Is there any work on adding multi-class classification capabilities? Maybe we could start something with gbm.
-
see discussion in https://groups.google.com/d/msg/pystatsmodels/WtqcXF3KtJI/wVH4HrHtHVMJ
target: get equivalent to Stata's `teffects`
## References
mix between statistics, econometrics and Stata (…
-
EPA seems like our best threshold material to consider and see if our model is doing well or poorly.
We can consider effects of penalties on EPA and see if there's interaction by team?
We have e…
-
Hi Will,
Do you have any suggestions on how to choose the number of principal components in GLM-PCA? Is there a way to quantify the contributions of each PC similar to the proportion of varia…
-
## Doing
- [x] Clean Data Thinking Zulip chat data, located at https://github.com/onefact/datathinking.org-codespace/blob/main/data/datathinking.zulipchat.com/raw/messages-000001.json - put it in a `…
-
I am very happy to that see someone implementing adaptive Lasso in Python (#169)! It would be great if celer also implemented the more general LLA algorithm for any folded concave penalty e.g. see [On…
-
The goal of this assignment was to introduce you to 2 main concepts in Machine Learning: Data Pre-processing, and Classification. You learned how to query and clean data using the pandas library in Py…
-
It seems to me that your MiceImputer only uses complete cases to train the SingleImputers but from what I red about MICE imputation it should not be the case:
>Step 1: A simple imputation, such as …
-
First of all: thanks for this great package\! Since `RTextTools` was recently removed from CRAN I was trying to find a good solution for SML on text data in `R` and was a bit frustrated by `caret` whi…