neural-chat-7b Search Results

opea-project/GenAIComps #945

[Bug]

### Priority P1-Stopper ### OS type Ubuntu ### Hardware type Xeon-other (Please let us know in description) ### Installation method - [ ] Pull docker images from hub.docker.com - [X] Build dock…

thiagoldaniel updated 6 days ago

huggingface/tgi-gaudi #238

Incorrect answer with openai compatible penalty parameters

### System Info Hi there, I met a bug that when using TGI Gaudi 2.0.5 with both meta-llama/Meta-Llama-3-8B-Instruct and Intel/neural-chat-7b-v3-3. When I set the default frequency/repetition/presen…

Spycsh updated 3 weeks ago

BigData-KSU/RS-LLaVA #2

Confusion about the weights of mm_projector

In the inference script, the default weights of RS-LLaVA are as follows: model_path = 'BigData-KSU/RS-llava-v1.5-7b-LoRA' model_base = 'Intel/neural-chat-7b-v3-3' However, these two models do not…

jian-rookie updated 2 weeks ago

cloudflare/workerd #2181

Improve Workers AI types

Today, the Workers AI types rely heavily on function overloads to specify arguments for different models. This unfortunately results in very difficult to debug types, and poor DX. As an example wit…

Cherry updated 3 months ago

LlamaEdge/LlamaEdge #71

Model list on `https://huggingface.co/second-state`

### Summary - Provide k-quant models - Maintain existing gguf models - Embedding models - [x] [second-state/Nomic-embed-text-v1.5-Embedding-GGUF](https://huggingface.co/second-state/Nomic-…

apepkuss updated 3 days ago

intel-analytics/ipex-llm #10507

Run neural-chat 7b inference with Deepspeed on Flex 140.

The Intel GPU Flex 140 has two GPUs per card, with a memory capacity of 12 GB (6GB per GPU). Currently, I can do the inference only on one GPU device with limited memory. Could you please guide to run…

Vasud-ha updated 6 months ago

intel/neural-speed #230

i wish for simpler way to run the model

i'm not well versed with python and where do i put the downloaded llama-2-7b-chat.Q4_0.gguf file? i can make llama.cpp work real easy on my laptop but i cant seem to get this to work i did git c…

kolinfluence updated 6 months ago

intel/intel-extension-for-transformers #1407

Fails to load saved model : Trying to set a tensor of shape …

Loading saved model runs into following error It also takes a very long time to run and save quantized models. ``` 2024-03-21 08:48:58 [INFO] loading weights file models/4_bit_llama2-rtn/model.sa…

kranipa updated 5 months ago

Jazee6/cloudflare-ai-web #61

更换cloudflare 模型问题

使用更改GitHub代码的方式更改模型，代码中更改了Llama 3.1，写法符合cloudflare模型和项目模型的格式。在重新部署后无法使用新模型

laorent updated 2 months ago

intel/neural-speed #194

AssertionError: Fail to convert pytorch model

this is using the example code only ``` from transformers import AutoTokenizer, TextStreamer from intel_extension_for_transformers.transformers import AutoModelForCausalLM model_name = "Intel/neur…

anthony-intel updated 6 months ago

264 results for neural-chat-7b

264 results
for neural-chat-7b