-
## 🚀 Feature
### Motivation
Implements incremental block distributed data parallelism similar to https: ieeexplore.ieee.org/document/7472805.
This can mitigate the performance loss caused …
-
Hello!
Can you, please, provide the bash script for training Transformer-XL on PTB dataset with PyTorch?
Thanks!
-
When attempting to execute the `FastChat\scripts\train_vicuna_7b.sh` script, it raises an exception with the following error message:
```
File "/usr/local/lib/python3.10/dist-packages/transformer_…
-
**Describe the bug**
I ran the official tutorial code for onnx
[(https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/python/tools/transformers/notebooks/PyTorch_Bert-Squad_OnnxRuntime…
-
generating images for - the grand canyon with snow on it. snow located on the grand canyon. a snowy grand canyon.: 0% 0/1 [00:00
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
17 prefix_state_dict = torch.load(os.path.join(CHECKPOINT_PATH, "pytorch_model.bin"))
18 n…
-
我往lprnet网络结构的顶端添加了一个stn,那个空间变换网络,但是似乎这个新结构训练难度很大,loss不下降。有人知道怎么做吗?
i added a STN network(the spacial transforming network) to the top of LPRNet, but i find training this new structure quite difficu…
-
### Description:
I encountered an error while using torch2trt for converting a PyTorch model with Transformer operator. The error message is as follows:
```
Traceback (most recent call last):
..…
Thrsu updated
10 months ago
-
### System Info
- `transformers` version: 4.41.2
- Platform: Windows-10-10.0.19045-SP0
- Python version: 3.11.1
- Huggingface_hub version: 0.23.3
- Safetensors version: 0.4.3
- Accelerate versio…
-
Your work is outstanding, and I admire the efficiency achieved in your mamba implementation.
However, I’m concerned about its accessibility and broader adoption in comparison to transformer-based …