falcon-meta Search Results

563 results
for falcon-meta

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

h2oai/h2ogpt #1349

One click Macos install/Run h2ogpt-osx-m1-gpu ---- errors on…

I used Macos one click to install/Run h2oGpt. Web GUI at port 7860 works and I got the interface. 1. first, trying to load theBloke/Mistral-7B-Instruct-v0.1-GGUF, error "File "transformers/tokeniza…

pl2k2000 updated 7 months ago
5
Soulter/hugging-chat-api #56

Feature Request: Ability to select model

The Ability to select between meta-llama/Llama-2-70b-chat-hf and OpenAssistant/oasst-sft-6-llama-30b would be nice.

linux-leo updated 9 months ago
14
Soulter/hugging-chat-api #146

Support for Mixtral 8x7B

Not sure how easy it is to add models, but this one is proving the best so far, and is available on Hugging Chat. Model: [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mix…

tomjuggler updated 10 months ago
4
mudler/LocalAI #1570

GPU layers param breaks the model i.e. I am not able to util…

**LocalAI version:** I am on commit: **574fa67bdcafd618859fcda4d239f10f326182a6** **Environment, CPU architecture, OS, and Version:** I am on Windows and using WSL2 with Ubuntu 22.04: …

KramPiotr updated 7 months ago
2
huggingface/text-generation-inference #556

RuntimeError: weight encoder.embed_tokens.weight does not ex…

After running: > docker run --gpus all --shm-size 1g -p 8080:80 -v $PWD/data:/data ghcr.io/huggingface/text-generation-inference:0.9 --model-id google/flan-t5-small --num-shard 1 I recieve: > Run…

chumpblocckami updated 2 months ago
13
ggerganov/llama.cpp #1575

Extend ggml format to include a description of the model.

On Hugging Face there are many files called ggml-model-f16.bin or similar. Once downloaded the user can rename them. The information about its origin gets lost. Updating the file becomes difficult whe…

darxkies updated 4 months ago
14
DFKI-NLP/perseus-textgen #3

test AWQ quantizised models

requires the image: `huggingface_text-generation-inference_1.1.0.sqsh` (see https://github.com/huggingface/text-generation-inference/releases/tag/v1.1.0) **Note, that all of the commands require to…

ArneBinder updated 10 months ago
3
NVIDIA/TensorRT-LLM #102

High CPU memory usage (Llama build Killed)

I am trying to run CodeLlama with the following setup: Model size: 34B GPUs: 2x A6000 (sm_86) I'd like to to run the model tensor-parallel across the two GPUs. Correct me if I'm wrong, but the …

atyshka updated 6 months ago
26
banditelol/public-notes #12

Checklist

- [ ] Create philosophical shorts for why LLM may actually "understand" - [ ] Create a weekly target - [ ] Reflect on how I would trickle from year to daily vision - [ ] Create gigs on fastwork - [ ] …

banditelol updated 7 months ago
1
zhouwg/kantv #121

PoC: Add Qualcomm mobile SoC native backend for GGML

Background of this PoC: 1.[GGML](https://github.com/ggerganov/ggml) is a very compact/highly optimization pure C/C++ machine learning library. GGML is also the solid cornerstone of the amazing [whi…

zhouwg updated 4 months ago
25

上一页 1...17 18 19 20 21 22 23...57 下一页

563 results for falcon-meta

563 results
for falcon-meta