-
### System Info
- **Hardware**: AWS g6.12xlarge (us-east-2) / 4x NVIDIA L4 GPU
- **OS**: Ubuntu 24.04 LTS (Noble Numbat)
- **NVIDIA Driver**: nvidia-open 560.28.03
- **CUDA**: 12.6
- **Docker**: …
-
### Model/Pipeline/Scheduler description
Currently, most existing camera motion control methods for video generation with denoising diffusion models rely on training a temporal camera module, and nec…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.3.1+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
llama_model_loader: loaded meta data with 32 key-value pairs and 219 tensors from /data/huggingface/hub/models--city96--t5-v1_1-xxl-encoder-gguf/snapshots/005a6ea51a7d0b84d677b3e633bb52a8c85a83d9/./t5…
-
This is a little bit of a plug, so I'll keep it short! I'm trying to nail down _**exactly** what's going on here_.
https://riprompt.com
https://riprompt.com/riprompt.txt
https://chatgpt.com/g/g-9…
-
https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab957409a1cc2fbfba8a26/mingpt/model.py#L42
Why do we need an additional linear transformation after the MHA and before the MLP when the dim…
-
Thanks for sharing work for LLM quantization & onnx export.
I follow the script in '[Convert to onnx model](https://github.com/wejoncy/QLLM?tab=readme-ov-file#convert-to-onnx-model)' section, and g…
-
Thank you for the code! I've been using it as a reference for my own implementation. Have you replicated the results in the original blogpost..? Based on your update in the readme, it seems like you h…
-
Hi, I have trained a new model but meet errors when testing, I did it as:
1. train a model with:
```
accelerate launch --num_processes 2 --multi_gpu --mixed_precision "fp16" \
tutorial_train.py …
-
I noticed that CLIP is already present in the Hailo Model Zoo, which suggests that conversion is possible. [link](https://github.com/hailo-ai/hailo_model_zoo/blob/833ae6175c06dbd6c3fc8faeb23659c9efaa2…