-
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…
-
通过CSD过滤我获取到了较高的精准度,但召回率仅有56%,即使无过滤召回率也只有65%,请问有进一步提高召回率的思路吗,比如更换bert模型哈工大的Chinese-BERT-wwm和最新albert模型,增加更丰富的训练数据,这样可行吗?
-
📚 This guide explains how to properly use **multiple** GPUs to train a dataset with YOLOv5 🚀 on single or multiple machine(s). UPDATED 25 December 2022.
## Before You Start
Clone repo and instal…
-
Can consider the following settings:
- [ ] standad classification
- [ ] data labeled by multiple annotators
- [ ] multi-label classification
For extending `calibrate_pred_probs()` to multi-label c…
-
## 🚀 Feature
Right now, `module.load_state_dict(strict=False)` allows the following:
* loading a dict with missing parameters
* loading a dict with more parameters than needed
And it returns an …
-
**Describe the bug**
tokenizer map in `hf_decoder_model` use multi `preprocessing_num_workers` will return `TypeError: cannot pickle 'torch._C._distributed_c10d.ProcessGroup' object`
**To Reprodu…
-
**Environment:**
1. Framework: Keras
2. Framework version: 2.2.4
3. Horovod version: 0.16.4
4. MPI version: mpirun (Open MPI) 3.1.0
5. CUDA version: 9.0.176
6. NCCL version: 2.3.5
7. Python ver…
-
Hey! I'm trying to train on multiple GPUs and consistently getting the following RuntimeError. Here's the modified line in `tutorial_train.py`:
```
trainer = pl.Trainer(gpus=8, precision=32, call…
-
### 问题描述:
nvidia v100单卡32G显存 lora训练Baichuan2-7B-Base报OOM异常:torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.92 GiB (GPU 0; 31.74 GiB total capacity; 30.00 GiB already allocated; 3…