Traceback (most recent call last):
File "train.py", line 427, in
main()
File "train.py", line 423, in main
train(model, logger, train_dataset, validate_dataset, args)
File "train.py", line 268, in train
train_dataloader = DataLoader(
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 351, in init
sampler = RandomSampler(dataset, generator=generator) # type: ignore[arg-type]
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/sampler.py", line 107, in init
raise ValueError("num_samples should be a positive integer "
ValueError: num_samples should be a positive integer value, but got num_samples=0
因为负样本的数量非常的少(只有150条左右,每条对话的长度不会超过100个字,中英文混杂)
preprocessing以后,使用train.py会报错如下,请求帮助。问题是出在哪里?
Traceback (most recent call last): File "train.py", line 427, in
main()
File "train.py", line 423, in main
train(model, logger, train_dataset, validate_dataset, args)
File "train.py", line 268, in train
train_dataloader = DataLoader(
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 351, in init
sampler = RandomSampler(dataset, generator=generator) # type: ignore[arg-type]
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/sampler.py", line 107, in init
raise ValueError("num_samples should be a positive integer "
ValueError: num_samples should be a positive integer value, but got num_samples=0