-
### Version
1
### DataCap Applicant
Alan
### Project ID
12
### Data Owner Name
The National Oceanic and Atmospheric Administration
### Data Owner Country/Region
United States
### Data Owner …
-
I got this error:
```
The sum of train_size and test_size = 55757, should be smaller than the number of samples 55756. Reduce test_size and/or train_size.
Traceback (most recent call last):
File…
-
The current implementation of `rulefit` can sometimes produce redundant features that are then fed into the lasso. This comes from the stochastic nature of random trees and lack of rule pruning.
To…
-
- [x] rx1day (max 1day precip) annual summaries done incorrectly (sum of monthly was done but should have been max of monthly values to get 1 day max of the year)
fixed - replacement files : https://…
-
Hi there,
I am trying to load the downloaded Hubert Large with 960hr finetuning from here: https://github.com/pytorch/fairseq/tree/main/examples/hubert
I downloaded the model, stored the checkpo…
-
Dear authors,
I have to do 4 dimensions reweighting for 10M events using GBDT. With the CPU only, the process takes a full day.
My question is : Is it possible to write or is there already an in…
-
**Augmented Random Forest with Kernel Convolution**
For fast prototyping, a smooth and flexible representation of functions is essential. Traditional approaches using trees or forests for function …
-
## Background
I am trying to convert a python sklearn classifier model to onnx format to use in C#. I have made multiple different dev/test environments using windows and Mac, also used multiple diff…
-
### Describe the issue linked to the documentation
There's no where in the documentation that explains what method is used to identify which values to consider as candidate splits. For example, for r…
-
Bonjour à tous,
Après avoir discuté avec Alexandre et Olivier Goletti, nous pensons que la prochaine étape pour ce syllabus est d'homogénéiser son contenu. On vous propose donc d'établir des conven…