transformer-str Search Results

1000+ results
for transformer-str

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #33183

Doc bug: wrong token argument name for Tokenizer.from_pretra…

### System Info N/A for doc bug. ### Who can help? @stevhliu ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported task in the…

rravenel updated 1 day ago
7
LaVieEnRose365/ReLLa #12

get_semantic_embed 出错

python get_semantic_embed.py --model_path ./Llama-2-7b-hf --dataset BookCrossing --pooling average --gpu_id 1 miniconda3/envs/rella/lib/python3.10/site-packages/transformers/configuration_utils.py:9…

lightningsoon updated 1 month ago
1
pytorch/torchtune #1790

python 3.9 gpu tests occasionally fail with "No module named…

CI on PRs occasionally fails with the following message: ``` FAILED tests/recipes/test_eleuther_eval.py::TestEleutherEval::test_torchtune_checkpoint_eval_results[truthfulqa_gen-0.1-1] - RuntimeEr…

RdoubleA updated 1 week ago
2
Nerogar/OneTrainer #457

[Feat]: LoRA - Advanced Layer Filter

### Describe your use-case. Flux has layers named single_transformer_blocks.* and transformer_blocks.*. If I want to train only the **transformer_blocks.*** layers but exclude **single_transformer…

bananasss00 updated 1 week ago
3
run-llama/llama_index #16262

[Question]: lightweight colbert rerank installation

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question At this moment llama-index-postprocessor-colbert-rerank import requires torch and his nv…

schiaro98 updated 3 weeks ago
3
sayakpaul/diffusers-torchao #37

qkv_fuse_projections() fails with torchao quantized Flux2DTr…

## Summary When calling `qkv_fuse_projections()` on an instance of `Flux2DTransformerModel` that was quantized with `torchao`'s `quantize_`, it fails with the following error: ``` File "/Users/s…

ngaloppo updated 1 week ago
5
souzatharsis/podcastfy #26

Implementing a different version of youtube_transcriber.py u…

## Inspiration So there is a gradio space [https://huggingface.co/spaces/hf-audio/whisper-large-v3](url) that uses whisper, from the hugging face api : ```python import spaces import torch …

240db updated 6 days ago
6
daskol/lotr #2

Please publish end-to-end application example

Dear all, It would be great to see an end-to-end practical example of LoTR. By "practical" I mean that one takes, for example some existing LLM weights file, compresses it into a smaller weights fi…

dmikushin updated 1 week ago
3
huggingface/diffusers #9343

FLUX error when loading with low_cpu_mem_usage=False and ign…

### Describe the bug I'd like to change the input layers of FLUX for training some img2img stuff, but got: `TypeError: expected str, bytes or os.PathLike object, not NoneType` when loading `FluxTra…

primecai updated 1 week ago
8
THUDM/GLM-4 #605

TypeError: ChatGLM4Tokenizer._pad() got an unexpected keywor…

### System Info / 系統信息 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.129.06 Driver Version: 470.129.06 CUDA Version: 12.4 | |-------------…

LolerPanda updated 4 hours ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for transformer-str

1000+ results
for transformer-str