Why do we have 4 splits train.tsv, train_dev.tsv, devel.tsv, test.tsv? And is the train_dev.tsv merged by train.tsv and devel.tsv? And in the run_ner.py for example, why the train_dataset is transformed from train_dev.txt and eval_dataset is transformed from eval.txt? Thanks.
Hello, I have the same question. I am curating my own dataset for training, and I was wondering if the train_dev files are merged versions of the train and devel splits.
Why do we have 4 splits train.tsv, train_dev.tsv, devel.tsv, test.tsv? And is the train_dev.tsv merged by train.tsv and devel.tsv? And in the run_ner.py for example, why the train_dataset is transformed from train_dev.txt and eval_dataset is transformed from eval.txt? Thanks.