-
At the moment we can split the data into train/test and folds by patientId, rowId or time.
It would be nice to have an explicit splitter where you can provide the rowIds for the test/train/folds. …
-
I've noticed that using pred_contribs to generate shap values takes significantly more gpu memory in XGBoost 2.1.1 vs 1.4.2.
This can lead to having issues with generating shap values, where no issu…
-
### Problem
Part of epic https://github.com/probabl-ai/skore/issues/544
Currently, we do not plot the train score, which is important to visualize to understand what is going on between each spl…
-
See https://stackoverflow.com/questions/79163528/multistate-survival-models-with-censored-and-tidymodels
This might end up being a parsnip issue but we should start here.
```
library(tidyverse…
-
Hi, I want to know the train/test split on the TikTok and UBCFashion dataset. Could you please provide how you split the academic public dataset into training and testing?
According to https://githu…
-
Unlike test datasets, XCL seems to have a different structure that causes an error when initializing BirdSetDataModule:
```
from birdset.datamodule import DatasetConfig
from birdset.datamodule.…
-
**Describe the issue**:
API Documentation of dask train_test_split states that blockwise=False is supported for Arrays:
"For Dask Arrays, set blockwise=False to shuffle data between blocks as wel…
-
The `train_test_split` feature is not currently supported within the `load_mnist_dataset`. The user will have to call the split before calling the wrapper function.
It will be a better design to i…
-
hello,
thanks a lot for very interesting work.
I was wondering, that you have given train and val split but you haven't given the CityRefer_test.json split. can you please provide it?
thanks
-
from sklearn.model_selection import train_text_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2)
--
from sklearn,linear_model import LinearRegressin
lin_model = Li…