BaderLab / saber

Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.
https://baderlab.github.io/saber/
MIT License
102 stars 17 forks source link

Better use of validation split #165

Closed JohnGiorgi closed 5 years ago

JohnGiorgi commented 5 years ago

Overview

Previously, only one of k_folds or validation_split could be used, with k_folds taking precedence over validation_split. Now, they can be used together.

Otherwise, provide the partitions yourself with the files train.*, valid.* and test.* under dataset_folder and leave k_folds and validation_split equal to 0.

E.g.

.
├── NCBI_Disease
│   └── train.tsv
│   └── valid.tsv
│   └── test.tsv

k_folds will be ignored if either a valid.* or test.* file is found under dataset_folder. Both arguments k_folds and validation_split will be ignored if a valid.* file is found under dataset_folder.

TODOs

Closes

Closes #154.