-
**Describe the bug**
Running the most recent version of the T5 pretraining script out of the box raises a Value Error, particularly in the following line:
```
[rank0]: File "/home/miniconda3/lib/…
-
In `nmdc-schema` v9.4.0 and later, the `Migrator`s have the potential to create, delete, and rename collections.
Currently, the migration notebooks (effectively, ETL scripts) in `nmdc-runtime` do n…
-
### Describe the bug
When implementing the PixArtAlphaPipeline, one step of inference was bound to the DMD, which is inappropriate. This resulted in errors in other one-step inference codes based o…
-
### System Info
- `transformers` version: 4.44.0
- Platform: Linux-5.4.0-196-generic-x86_64-with-glibc2.31
- Python version: 3.12.0
- Huggingface_hub version: 0.23.4
- Safetensors version: 0.4.…
-
I know you call them transformers but for some reason in my mind they just seem closer to data generators than something that transforms :)
Anyway, I am working on a table that has the address brok…
-
### System Info
- `transformers` version: 4.36.2
- Platform: Linux-3.10.0-1160.59.1.el7.x86_64-x86_64-with-glibc2.17
- Python version: 3.9.19
- Huggingface_hub version: 0.24.6
- Safetensors versi…
-
Hello AnFreTh,
Thank you for your work on this project. I am currently using Mambular to process tabular data, but I am experiencing very slow training speeds. On average, each epoch is taking arou…
-
## Abstract
- Propose `Average Attention Network` module that serves as decoder for Transformer. Decoding speed improves x3~4 while preserving translation performance.
- Empirical evidence shown in …
-
Hi I downloaded 3 models from civitai, and none of them work, I don't know what am I doing wrong
Specs: M1 macbook air
Sonoma 14.5
Xcode 15.4
```
Starting python converter
scikit-learn ver…
-
Thank you for your outstanding work, which has allowed me to quickly start my fine-tuning process. However, I have the following two questions:
1. In the LoRA fine-tuning of the LLaVA series, most …