-
Hey!
Thanks for the great project and for sharing it with the community.
I am trying to inference with the HF [neural-chat](https://huggingface.co/Intel/neural-chat-7b-v3-1) model.
## What I t…
dopc updated
9 months ago
-
**Is your feature request related to a problem? Please describe.**
The chat title seems to be generated withe the same model that was used in the a chat, which can be unnecessarily slow when chatting…
-
### System Info
I have downloaded the setup from "https://gpt4all.io/installers/gpt4all-installer-win64.exe" and installed it as requested. However, after running the GPT4ALL from start menu, nothing…
-
I was trying to run this demo code:
```
from transformers import AutoTokenizer, TextStreamer
from intel_extension_for_transformers.transformers import AutoModelForCausalLM
model_name = "Intel/neur…
-
This is how I use the Dynamic Generator as a class object and the server wrapped in the flask.
But the responses get mixed when I send 2 requests simultaneously (mimicking client).
Note: The m…
-
### Issue Summary
While playing on monifactory 0.9.6 with fluxnetworks game crashed while shift clicking copper upgrades from functional storage to AE2 network and corrupted world
not enough crashes…
-
Here is the crash report
[https://pastebin.com/VTD7wrm1](url)
-
- [ ] Create philosophical shorts for why LLM may actually "understand"
- [ ] Create a weekly target
- [ ] Reflect on how I would trickle from year to daily vision
- [ ] Create gigs on fastwork
- [ ] …
-
### System Info
- `transformers` version: 4.39.0.dev0
- Platform: Linux-5.15.0-89-generic-x86_64-with-glibc2.35
- Python version: 3.10.12
- Huggingface_hub version: 0.20.1
- Safetensors version…
-
This RFC is to propose a Hugging Face-compatible yet flexible Weight Only Quantization (WOQ) format in INC, and then the model quantized by INC can be loaded by IPEX for further inference optimization…