original-transformer Search Results

1000+ results
for original-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/Megatron-LM #1005

[BUG] T5 extended attention mask shape mismatch with transfo…

**Describe the bug** Running the most recent version of the T5 pretraining script out of the box raises a Value Error, particularly in the following line: ``` [rank0]: File "/home/miniconda3/lib/…

andrewvli updated 1 month ago
5
microbiomedata/nmdc-runtime #449

Migrations: Design algorithm for dumping/restoring collectio…

In `nmdc-schema` v9.4.0 and later, the `Migrator`s have the potential to create, delete, and rename collections. Currently, the migration notebooks (effectively, ETL scripts) in `nmdc-runtime` do n…

eecavanna updated 1 month ago
2
huggingface/diffusers #8689

A bug about one-step inference in PixArtAlphaPipeline

### Describe the bug When implementing the PixArtAlphaPipeline, one step of inference was bound to the DMD, which is inappropriate. This resulted in errors in other one-step inference codes based o…

Luo-Yihong updated 3 weeks ago
3
huggingface/transformers #33985

../aten/src/ATen/native/cuda/Indexing.cu:1289: indexSelectLa…

### System Info - `transformers` version: 4.44.0 - Platform: Linux-5.4.0-196-generic-x86_64-with-glibc2.31 - Python version: 3.12.0 - Huggingface_hub version: 0.23.4 - Safetensors version: 0.4.…

JHW5981 updated 3 days ago
7
GreenmaskIO/greenmask #144

Data Generator Request: RandomCity

I know you call them transformers but for some reason in my mind they just seem closer to data generators than something that transforms :) Anyway, I am working on a table that has the address brok…

jensenbox updated 1 month ago
4
huggingface/peft #2100

Questions about original_module and modules_to_save.default

### System Info - `transformers` version: 4.36.2 - Platform: Linux-3.10.0-1160.59.1.el7.x86_64-x86_64-with-glibc2.17 - Python version: 3.9.19 - Huggingface_hub version: 0.24.6 - Safetensors versi…

dengchengxifrank updated 2 weeks ago
1
basf/mamba-tabular #129

Improve Mamba Speed

Hello AnFreTh, Thank you for your work on this project. I am currently using Mambular to process tabular data, but I am experiencing very slow training speeds. On average, each epoch is taking arou…

Jasmine-ycj updated 3 days ago
7
kweonwooj/papers #108

Accelerating Neural Transformer via an Average Attention Net…

## Abstract - Propose `Average Attention Network` module that serves as decoder for Transformer. Decoding speed improves x3~4 while preserving translation performance. - Empirical evidence shown in …

kweonwooj updated 6 years ago
2
GuernikaCore/GuernikaModelConverter #6

Unable to convert safetensors

Hi I downloaded 3 models from civitai, and none of them work, I don't know what am I doing wrong Specs: M1 macbook air Sonoma 14.5 Xcode 15.4 ``` Starting python converter scikit-learn ver…

LeonSolisPedro updated 1 month ago
2
zjysteven/lmms-finetune #39

fine-tuning the mmprojector

Thank you for your outstanding work, which has allowed me to quickly start my fine-tuning process. However, I have the following two questions: 1. In the LoRA fine-tuning of the LLaVA series, most …

lxr-1204 updated 2 weeks ago
2

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for original-transformer

1000+ results
for original-transformer