text-generation-inference Search Results

1000+ results
for text-generation-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rhymes-ai/Aria #23

Error: "No chat template is set for this processor"

I'm trying to run the example in `inference/notebooks/01_single_image_understanding.ipynb`, but I get this error: ``` text = processor.apply_chat_template(messages, add_generation_prompt=True) …

aymeric-roucher updated 3 weeks ago
2
huggingface/transformers #33399

The same situation as #31377 occurred when using Qwen/Qwen2-…

### System Info - `transformers` version: 4.45.0.dev0 - Platform: macOS-14.6.1-arm64-arm-64bit - Python version: 3.12.4 - Huggingface_hub version: 0.24.6 - Safetensors version: 0.4.5 - Acceler…

toondata updated 6 days ago
15
MadcowD/ell #56

Integrate with various LLM providers

- [ ] OpenAI - [ ] Anthropic - [ ] Groq - [ ] Cohere - [ ] Llama some how (ollama & groq are fine)

MadcowD updated 1 month ago
7
pytorch/serve #2564

[Docs] More information regarding text generation & LLM infe…

### 📚 The doc issue I am new to TorchServe and was looking for some features that I need to be able to consider using TorchServe for LLM text generation. Today, there are a couple inference servin…

jaywonchung updated 1 year ago
1
huggingface/optimum-neuron #684

Cannot host Llama-3-8B exported by optimum-neuron with TGI c…

### System Info ```shell AWS EC2 instance: trn1.32xlarge OS: Ubuntu 22.04.4 LTS Platform: - Platform: Linux-6.5.0-1023-aws-x86_64-with-glibc2.35 - Python version: 3.10.12 Python packages: …

cszhz updated 3 weeks ago
3
QwenLM/Qwen2-VL #299

RuntimeError: probability tensor contains either `inf`, `nan…

I have been trying to fix this error for a while now, and the ongoing threads are of NO help. I have checked these (and ALL issue on the HF community page for this model): * https://github.com/Qwe…

varungupta31 updated 1 month ago
2
QwenLM/Qwen2-VL #246

RuntimeError: expected mat1 and mat2 to have the same dtype,…

使用Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4，遇到报错RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::BFloat16

whitesay updated 1 month ago
2
unslothai/unsloth #793

Error when deploying on HF inference endpoints

Hi there, First thank you for unsloth, it's great! I've finetuned a llama-3-8b-Instruct-bnb-4bit and pushed it to hf hub. When I try to deploy it using [hf Inference Endpoints](https://huggingfa…

adamrobertolo78 updated 3 months ago
1
coqui-ai/TTS #3998

[Bug] mps doesn't seem to work

### Describe the bug Intel Mac. ``` tts=TTS(model_name='multi-dataset/xtts_v2/en',progress_bar=True).to('mps') tts.tts_to_file(text='testing', file_path='out.wav', speaker='Craig Gutsy', language=…

ajkessel updated 1 month ago
2
huggingface/text-generation-inference #2037

RuntimeError: FlashAttention only supports Ampere GPUs or ne…

### System Info Command Causing Issue: ``` model=microsoft/Phi-3-mini-4k-instruct volume=$PWD/data # share a volume with the Docker container to avoid downloading weights every run docker run -…

Ansh-Sarkar updated 1 month ago
18

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for text-generation-inference

1000+ results
for text-generation-inference