taishan1994 / BERT-BILSTM-CRF

使用BERT-BILSTM-CRF进行中文命名实体识别。
291 stars 32 forks source link

训练时报错 #6

Open goldeneave opened 11 months ago

goldeneave commented 11 months ago

大佬好,我在准备好自己的数据做微调时,数据报错:

` Traceback (most recent call last): File "main.py", line 190, in main(data_name) File "main.py", line 182, in main train.train() File "main.py", line 45, in train for step, batch_data in enumerate(self.train_loader): File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 363, in next data = self._next_data() File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 989, in _next_data return self._process_data(data) File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1014, in _process_data data.reraise() File "/root/miniconda3/lib/python3.8/site-packages/torch/_utils.py", line 395, in reraise raise self.exc_type(msg) RuntimeError: Caught RuntimeError in DataLoader worker process 0. Original Traceback (most recent call last): File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/_utils/worker.py", line 185, in _worker_loop data = fetcher.fetch(index) File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/_utils/fetch.py", line 47, in fetch return self.collate_fn(data) File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/_utils/collate.py", line 74, in default_collate return {key: default_collate([d[key] for d in batch]) for key in elem} File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/_utils/collate.py", line 74, in return {key: default_collate([d[key] for d in batch]) for key in elem} File "/root/miniconda3/lib/python3.8/site-packages/torch/utils/data/_utils/collate.py", line 55, in default_collate return torch.stack(batch, 0, out=out) RuntimeError: stack expects each tensor to be equal size, but got [512] at entry 0 and [511] at entry 2

` 所有的requirements都按照md文件里的版本安装,请问还有哪里的代码需要调整吗,谢谢!

taishan1994 commented 11 months ago

有一条样本的最大长度为511,和512不匹配。

goldeneave commented 11 months ago

西西嘛呦 @.***>于2023年8月3日 周四16:42写道:

有一条样本的最大长度为511,和512不匹配。

— Reply to this email directly, view it on GitHub https://github.com/taishan1994/BERT-BILSTM-CRF/issues/6#issuecomment-1663543021, or unsubscribe https://github.com/notifications/unsubscribe-auth/ANN257Y2XKRIYESPYG47PELXTNP7LANCNFSM6AAAAAA3CM4A2U . You are receiving this because you authored the thread.Message ID: @.***>

感谢大佬回复 请问有什么解决办法吗 或者在哪里修改呢

taishan1994 commented 11 months ago

data_loader.py里面进行修改