Open exalate-issue-sync[bot] opened 1 year ago
Neeraja Madabhushi commented: We need list of missing Rosettas for all clients.
J commented: More info for when there is time to work on this:
J commented: Preliminary list of languages:
Preliminary list of functions for Rosettas:
[~accountid:557058:3ae3c86a-e56a-4211-99d4-9a8cf5ab63f6], can you please review and update as needed? I'm sure there are other functions that should be included here, but I thought this would be a good starting point. Thanks!
Raymond Peck commented: The overarching goal is for this to be:
It should also link to the reference docs, where appropriate.
For "build model":
Add grid search.
Add data conversion (R data frame and data.table <-> H2O, Python Pandas / numpy / raw 2d arrays <-> H2O, RDD <-> H2O).
Import Files needs to include single file case + directory case.
Split frame should have both random (seeded) runif, plus row slice.
Add row and column slicing by name and index.
User row weights example.
Add examples for handling of unbalanced datasets both using Balance Classes and using user row weights.
Add examples of using GLRM and PCA to reduce dimensionality to feed into other model builders.
JIRA Issue Migration Info
Jira Issue: PUBDEV-1898 Assignee: Joby Joy Reporter: Raymond Peck State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A
Tiny examples in each language for each algo, for the website.