neural-chat-7b Search Results

265 results
for neural-chat-7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

VincyZhang/intel-extension-for-transformers #12

Unprocessable Entity using Neural Chat via OpenAI interface …

Is there a specific version of openai that is aligned with the OpenAI interfaces offered by neuralchat? I am currently testing using the current **1.12.0** but encountering a **422 Unprocessable Entit…

VincyZhang updated 9 months ago
6
VincyZhang/intel-extension-for-transformers #25

AssertionError: Fail to convert pytorch model with 'Intel/ne…

```python from transformers import AutoTokenizer, TextStreamer from intel_extension_for_transformers.transformers import AutoModelForCausalLM, WeightOnlyQuantConfig model_name = "Intel/neural-chat-…

VincyZhang updated 8 months ago
2
VincyZhang/intel-extension-for-transformers #10

422 Unprocessable Entity using Neural Chat via OpenAI interf…

Is there a specific version of openai that is aligned with the OpenAI interfaces offered by neuralchat? I am currently testing using the current **1.12.0** but encountering a **422 Unprocessable Entit…

VincyZhang updated 9 months ago
2
intel/neural-compressor #1600

Unable to save llama2 after SmoothQuant

Hi all, I'm attempting to follow the SmoothQuant tutorial for the LLAMA2-7b model: [https://github.com/intel/neural-compressor/tree/master/examples/onnxrt/nlp/huggingface_model/text_generation/llam…

dellamuradario updated 9 months ago
1
intel/intel-extension-for-transformers #1288

422 Unprocessable Entity using Neural Chat via OpenAI interf…

Is there a specific version of openai that is aligned with the OpenAI interfaces offered by neuralchat? I am currently testing using the current **1.12.0** but encountering a **422 Unprocessable Entit…

brent-elliott updated 9 months ago
2
intel-analytics/ipex-llm #10924

Performance drop for neural-chat 7b with new repo of ipex-ll…

We have seen a significant difference in performance drop with the env created with the latest repo for vllm serving for the neural-chat model as compared to the old env built with the old repo. With …

Vasud-ha updated 6 months ago
22
OpenNMT/CTranslate2 #1676

BENCHmarking new flash attention!

Congrats on Flash Attention in the latest version, or to be precise, in having your storage limit increased on Pypi.org so you could upload the release that was weeks ago. Here are some benchmarks fo…

BBC-Esq updated 7 months ago
10
intel/neural-speed #284

Performance on Xeon Scalable

Hello everyone, we are seeing slower than expected inference times on one of our CPU node with Intel(R) Xeon(R) Platinum 8362 CPU @ 2.80GHz with following instruction sets: ``` fpu vme de pse tsc…

regmibijay updated 5 months ago
1
opea-project/GenAIExamples #888

[Feature] deploy OPEA in small memory of desktop

### Priority P3-Medium ### OS type Ubuntu ### Hardware type AI-PC (Please let us know in description) ### Running nodes Single Node ### Description As AI PC or OPEA developer, I want to deplo…

RuijingGuo updated 1 month ago
1
OpenInterpreter/open-interpreter #1177

Jan.ai local model selection looks broken

### Describe the bug When attempting to run "interpreter --local" and choosing jan.ai as the llm provider, the model choice function crashes interpreter. LM_Studio runs as expected. (I'm assumi…

jayma777 updated 7 months ago
3

上一页 1...1 2 3 4 5 6 7...27 下一页

265 results for neural-chat-7b

265 results
for neural-chat-7b