cnellington / Contextualized

An SKLearn-style toolbox for estimating and analyzing models, distributions, and functions with context-specific parameters.
http://contextualized.ml/
GNU General Public License v3.0
65 stars 9 forks source link

Dataset testsplit #5

Closed blengerich closed 2 years ago

blengerich commented 2 years ago

2 issues with the testsplit parameter:

1) It is difficult to set for large datasets when we can only hold a very small fraction of data in memory (<0.0001). 2) All test samples currently come from the same cells.