neural-chat-7b Search Results

264 results
for neural-chat-7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #2930

Ollama异常终止

English: After upgrading to version 0.1.27, there has been a noticeable improvement in performance. Although the generation speed is not very fast, the program runs without significant lag. However, …

GeYingzhen01 updated 6 months ago
11
intel/intel-extension-for-transformers #531

What is the system requirement to run the sample code?

What is the system requirement to run the following sample code? from transformers import AutoTokenizer from intel_extension_for_transformers.transformers import AutoModelForCausalLM, WeightOnlyQ…

sungkim11 updated 1 year ago
6
ollama/ollama #3736

v0.1.32 is running GPU capable models on CPU

### What is the issue? I sometimes find that Ollama runs a model that should be on the GPU on the CPU. I just upgraded to v0.1.32. I am still trying to find out how to reproduce the issue. I don't …

MarkWard0110 updated 7 months ago
38
huggingface/transformers #27708

`load_in_4bit=True` works only with models in `safetensors` …

### System Info Hi, When testing on `Google Colab (Free Tier T4 GPU)`, this code crashes with RAM OOM [(Notebook)](https://colab.research.google.com/drive/1zAzdcH_KRQuc_0zWBEzYuaV1h4ERgzPy?usp=s…

danielkorat updated 12 months ago
8
psugihara/FreeChat #34

Responses are truncated

I've attached a screen capture of responses being truncated. Also, an image of my Settings; just in case, and I am trying the: prometheus-13b-v1.0.Q5_K_M.gguf which seems similar to GPT-4 (sort …

cleesmith updated 1 year ago
6
ggerganov/llama.cpp #4799

llama.cpp considers example grammar file from the tree as in…

I'm using fedora 39 and the latest git version of llama.cpp [96e80da] llama.cpp is built with CLBLAST on (intel IRIS Xe on a laptop). I wanted to test the grammar feature of llama.cpp with the fol…

PhilippeRo updated 11 months ago
2
srush/llama2.rs #19

The generation speed is superb, while the context was being …

Thanks for the project, I have managed to run the project on CPU with decent speed (**6.2 - 6.8 tokens per second**), however, the the model only generates a small piece of content, and the response…

guoqingbao updated 1 year ago
18
ml-explore/mlx-examples #355

mlx_lm.convert seems to modify special_tokens_map.json, toke…

Hi. I am trying to understand issues with a conversion of NeuralBeagle14 which does not correctly use stop words when using ChatML in prompts. It seems that the generated special_tokens_map.json, t…

da-z updated 10 months ago
16
ollama/ollama #1586

ollama models corrupted?

I've noticed that after running a few models, sometimes the models don't behave normally. This is a session where that was occurring. I had first tried with bakllava but it wasn't being helpful eithe…

iplayfast updated 11 months ago
5
ollama/ollama #1401

wizard-math model gives infinite answers

When asked a strictly math question it does fine. However when asked "what is your knowledge" the answer is The answer is: Good. The answer is: Good. ].join(',') ].join(','.split(…

iplayfast updated 8 months ago
5

上一页 1...20 21 22 23 24 25 26...27 下一页

264 results for neural-chat-7b

264 results
for neural-chat-7b