-
Collecting pygemma==0.1.3
Using cached pygemma-0.1.3.tar.gz (4.0 kB)
Installing build dependencies ... done
Getting requirements to build wheel ... done
Preparing metadata (pyproject.toml)…
-
### Bug description
For some reason, the tensor parallel implementation generates non-sensical outputs
```
⚡ python-api-tensor-parallel ~/litgpt litgpt generate_tp checkpoints/microsoft/phi-2
…
rasbt updated
2 months ago
-
**Authors:** @keijibranshi @rosbo @mrisdal @neshdev [at.bflynn](mailto:bflynn@kaggle.com)
# Summary
This RFC proposes extending torchtune to support loading pre-trained and fine-tuned model weig…
-
File "torch/_dynamo/variables/constant.py", line 176, in call_method
return ConstantVariable.create(op(self.value, add_target))
torch._dynamo.exc.InternalTorchDynamoError: 'bool' object is n…
-
I checked [https://huggingface.co/fwtan/llama-1.1b-mobilequant-w4a8-s1024-e60-sym-hf](https://huggingface.co/fwtan/llama-1.1b-mobilequant-w4a8-s1024-e60-sym-hf), and found the config.json in this repo…
-
Bumps the pip group in /python with 3 updates: [transformers](https://github.com/huggingface/transformers), [pillow](https://github.com/python-pillow/Pillow) and [pymongo](https://github.com/mongodb/m…
-
### Your current environment
- vLLM CPU : v0.6.0
- Hardware: Intel(R) Xeon(R) Platinum 8480+ CPU
- Model: google/gemma-2-2b
### 🐛 Describe the bug
vLLM v0.6.0 (cpu) is throwing below erro…
-
## 🚀 Feature
Add support for arm64 Windows/Linux for the Qualcomm Snapdragon XElite PCs
## Motivation
These Copilot+ PCs are now available widely all over the world. It has the potential of …
-
### Summary
- Provide k-quant models
- Maintain existing gguf models
- Embedding models
- [x] [second-state/Nomic-embed-text-v1.5-Embedding-GGUF](https://huggingface.co/second-state/Nomic-…
-
**Describe the bug**
Hi
I am trying sampler example here https://keras.io/examples/generative/text_generation_gpt/ in Gemma
the preprocessor in Gemma return dictionary of token_ids and padding…