hidden-causal Search Results

1000+ results
for hidden-causal

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bitsandbytes-foundation/bitsandbytes #1268

RuntimeError: mat1 and mat2 shapes cannot be multiplied (whe…

### System Info ``` Ubuntu 20.04 Python 3.10.14 torch 2.3.0 transformers 4.42.3 bitsandbytes 0.42.0 CUDA Version: 12.4 GPU 3090 torch.cuda.is_avai…

wizardforcel updated 3 weeks ago
1
huggingface/transformers #30529

Use models as Seq2Seq model

### System Info - `transformers` version: 4.40.0 - Platform: Linux-6.1.58+-x86_64-with-glibc2.35 - Python version: 3.10.12 - Huggingface_hub version: 0.22.2 - Safetensors version: 0.4.3 - Accele…

Bachstelze updated 6 months ago
2
intel-analytics/ipex-llm #10177

failed to inference latest version (67cf22a4e6809edb7308dd0a…

In latest commit, https://huggingface.co/mosaicml/mpt-7b/commit/67cf22a4e6809edb7308dd0a2ae2c1ffb86f4984, BigDL throws below error when generate text. INFO 2024-02-20 06:41:05,962 proxy 172.17.0.2 …

jiafuzha updated 9 months ago
9
bitsandbytes-foundation/bitsandbytes #607

How to quantize customized models?

I want to quantize model from [open-flamingo](https://github.com/mlfoundations/open_flamingo) or https://github.com/open-mmlab/Multimodal-GPT (open-flamingo v1) before lora training, https://github…

YerongLi updated 10 months ago
4
huggingface/transformers #33147

Multi-GPU setup: indices should be either on cpu or on the s…

### System Info python version: 3.11.9 transformers version: 4.44.2 accelerate version: 0.33.0 torch version: 2.4.0+cu121 ### Who can help? @gante ### Information - [X] The official example sc…

justnoxx updated 16 hours ago
23
pytorch/pytorch #126654

torch.nn.functional.scaled_dot_product_attention returns NaN…

### 🐛 Describe the bug When using `torch.nn.functional.scaled_dot_product_attention` with autograd a tensor filled with NaN values are returned after a few backward passes. `Using torch.autograd.s…

daniel-padban updated 6 months ago
5
comfyanonymous/ComfyUI #5735

Error on tiled VAE with LTX-Video

### Expected Behavior It should produce a video using the LTX-Video workflow ### Actual Behavior Pop-up with error `The expanded size of the tensor (192) must match the existing size (768) at…

gtx155 updated 3 days ago
2
pytorch/pytorch #130486

RuntimeError: NVML_SUCCESS == DriverAPI::get()->nvmlInit_v2_…

### 🐛 Describe the bug ''' checkpoint_path = './llama_relevance_results' training_args = transformers.TrainingArguments( #remove_unused_columns=False, # Whether or not to automatically r…

Zzv213 updated 2 weeks ago
4
center-for-humans-and-machines/transformer-heads #13

Question regarding output_activation="linear"

HeadConfig( name=f"num_tokens_regression", layer_hook=-7, hidden_size=128, # MLP hidden size num_layers=3, # 2 hidden layers in MLP in_size=hidden_s…

ArchchanaKugathasan updated 1 week ago
4
emma-mens/transformers #1

Map out the code for key-value cache

- [x] Usage in the [decoder](https://github.com/emma-mens/transformers/blob/main/src/transformers/models/opt/modeling_opt.py#L316) layer and the corresponding `past_key_values` [usage](https://github.…

emma-mens updated 1 year ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for hidden-causal

1000+ results
for hidden-causal