wizardlm Search Results

829 results
for wizardlm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #196

No `device_map` option.

Currently there is no way to use large models hence there is no support for 8-bit quantization and more importantly there is no support for device mapping. As you can see first GPU is filled but s…

beratcmn updated 1 year ago
4
ggerganov/llama.cpp #1689

[User] How can instructions be sent without entering interac…

I would like a script to pass a a single instruction and receive an answer.

BrLlan updated 10 months ago
11
LostRuins/koboldcpp #164

The seed is not randomized?

I have noticed that models give the same answers with the same prompt. It seems as if the seed is not randomized.

Vladonai updated 10 months ago
22
simonw/llm #185

Initial abstraction for running embeddings

I want to start experimenting more with Retrieval Augmented Generation. As part of that, I want to be able to calculate embeddings against different models. I want `llm` to grow a `llm embed` comma…

simonw updated 10 months ago
22
rustformers/llm #315

CUDA decoding

Hey all, great work on integrating cuda support for the prompt tokens. How much work would it be to support GPU decoding? Currently on llama.cpp I can reach about 35 tokens per second on llama 7B on a…

jafioti updated 11 months ago
16
abetlen/llama-cpp-python #218

Segmentation fault while generating.

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [X] I am running the latest code. Development is very rapid so there are no tagged versions as of…

Firstbober updated 1 year ago
10
marella/ctransformers #8

Segmentation fault on m1 mac

Trying simple example on m1 mac: ``` from ctransformers import AutoModelForCausalLM llm = AutoModelForCausalLM.from_pretrained( "/path/to/starcoderbase-GGML/starcoderbase-ggml-q4_0.bin", …

s-kostyaev updated 1 year ago
65
oobabooga/text-generation-webui #1877

Support for MPT 4bit

**Description** Support for loading 4bit quantized MPT models **Additional Context** Occam released it, and added support for loading it to his GPTQ fork and his KoboldAI fork, which may be u…

Nixellion updated 10 months ago
17
oobabooga/text-generation-webui #2093

CPU Allocator Error

### Describe the bug Yesterday, this was working perfectly fine. However, I decided to update it using the "update_windows.bat" file, and now I can't get any model to run. The main model I am trying …

TheMeIonGod updated 1 year ago
3
huggingface/tokenizers #1141

Support for incremental decoding

I would like to be able to decode a sequence of token ids incrementally in a decoder-agnostic manner. I haven't found a straightforward way to do this with the current API - the first token is treated…

njhill updated 1 year ago
16

上一页 1...75 76 77 78 79 80 81...83 下一页

829 results for wizardlm

829 results
for wizardlm