pytorch-transformers Search Results

1000+ results
for pytorch-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

comfyanonymous/ComfyUI #4572

SDXL generate black images with new --fast arg

### Expected Behavior - ### Actual Behavior ![image](https://github.com/user-attachments/assets/1f9608dc-4631-41c3-bd2a-bfe506d39104) SD15 and Flux work fine, the problem is only with SDXL Co…

bananasss00 updated 18 hours ago
16
huggingface/transformers #34867

Data prefetching does not occur for iterable datasets

### System Info - `transformers` version: 4.46.1 - Platform: macOS-15.1-arm64-arm-64bit - Python version: 3.11.10 - Huggingface_hub version: 0.26.2 - Safetensors version: 0.4.5 - Accelerate ve…

Lucaweihs updated 2 days ago
1
linkedin/Liger-Kernel #401

ValueError: Pointer argument (at 0) cannot be accessed from …

### 🐛 Describe the bug Getting `ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)` when doing inference using HF `from_pretrained()` with `device_map="auto"`. ### …

shivam15s updated 4 days ago
1
comfyanonymous/ComfyUI #5561

!!! Exception during processing !!! 'tokenizers.AddedToken' …

### Your question Error while loading checkpoint:!!! Exception during processing !!! 'tokenizers.AddedToken' object has no attribute 'special' ### Logs ```powershell got prompt model weight dtype…

npubg updated 2 weeks ago
1
iree-org/iree-turbine #24

Pytorch Model Exporting Issue: MLIR Verification Failed

With the latest torch (2.4) and iree-turbine, we are seeing this MLIR verification failure come up for a lot of our models during the export stage (aot.export). Instructions to reproduce this error…

saienduri updated 1 week ago
2
Lightning-AI/tutorials #139

Finetune Transformers Models with PyTorch Lightning: documen…

When calculating the total steps, shouldn't we use `number of batches * epoch size` ? In this case, it would be `self.total_steps = (len(train_loader.dataset) // tb_size) * ab_size` instead of `self.t…

yfeng24816 updated 4 months ago
10
unslothai/unsloth #865

Deploying llama3.1 8b instruct to sagemaker model endpoints

``` python # sagemaker config instance_type = "ml.g4dn.xlarge" number_of_gpu = 1 health_check_timeout = 300 config = { "HF_MODEL_ID": "unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit", # mode…

mleiter696 updated 1 week ago
6
sail-sg/Agent-Smith #5

TypeError: empty() missing 1 required positional arguments: …

Distributed environment: FSDP Backend: nccl Num processes: 4 Process index: 3 Local process index: 3 Device: cuda:0 Mixed precision type: bf16 Distributed environment: FSDP Backend: nccl …

durenajafamjad updated 2 hours ago
2
huggingface/transformers #34335

ValueError: Architecture deepseek2 not supported

### System Info The current Transformers framework doesn't support the gguf quantized model files from deepseek2. Can you please advise when this support might be added? @SunMarc @MekkCyber ###…

czq99972 updated 1 week ago
4
ROCm/clr #78

[Issue]: hipMemcpyWithStream causes severe stall in Hugginfa…

### Problem Description Hi, When doing text generation with Mistral 7b with Hugginface transformers on a MI100 GPU, I can see in the collected torch trace that a lot of time is wasted due a hipMem…

Epliz updated 3 weeks ago
15

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for pytorch-transformers

1000+ results
for pytorch-transformers