-
Hello,I reproduce this paper with the dataset in WMT. Then I got:
Training: Loss = 0.000887, Accuracy = 0.9981, Precision = 0.9998, Recall = 0.9889, F1 = 0.9943
Validing: Loss = 0.003075, Accur…
-
Can provide links of Albert's model and sentence word model ,Thks.
-
根据文档进行文本分类,https://github.com/PaddlePaddle/PaddleHub/tree/release/v2.1/demo/text_classification
环境:python3.7;win10;paddlepaddle=2.2.2;paddlehub=2.2.0
在原文档中使用的是name=ernie_tiny模型,想要替换为name=ernie,版本为2.…
-
When I use "BertForQuestionAnswering.from_pretrained" to import my pytorch_model.bin, there having a error "tarfile.ReadError: not a gzip file".So, what should I do to use my pytorch_model.bin? Thanks…
-
您好,我想问一下扩充的词表起到什么作用?
https://github.com/pengxiao-song/LaWGPT/blob/main/resources/legal_vocab.txt 存在重复token(比如`公正审判`,第968行和第4137行),与chinese-llama合并时需要先对自身去重才能正常合并。
```python
# Load custom vocabu…
-
please tell me, can i use Multilingual pretrain model from Bert to train custom data with albert code ???
-
While running train.py I encountered this error:
`Model name 'model/' was not found in model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingua…
-
您好!我用resnet18提取了数据集的特征,但是不知道怎样训练,好像训练代码是针对于bottom-up-attention的。同时也不是很清楚如何用bottom-up-attention提出AI Challenger训练集的特征
-
### Your current environment
The output of `python collect_env.py`
```text
Your output of `python collect_env.py` here
```
### 🐛 Describe the bug
(base) bob@test-ESC8000A-E11:~$ python…
-
想问下UER支持从断点加载模型重新训练吗? 我设置50w个step的训练, 因为服务器不稳定的原因训练有时会中断,所以我设置了断点保存,然后每次从新训练的时候,都是将pretrain.sh 中的 pretrained_model_path 路径重新设置为断点路径,然后再修改下要训练的step数目。 我的数据大概200w+,使用的是 bert-wwm, 进行mlm训练, batch_size 为 1…