codellama Search Results

1000+ results
for codellama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray-llm #94

Multiple models second models always request GPU: 1

Using the instructions here: https://github.com/ray-project/ray-llm#how-do-i-deploy-multiple-models-at-once I'm trying to host two models on a single A100 80G. Two bundles are generated for the pla…

lynkz-matt-psaltis updated 11 months ago
2
stitionai/devika #388

What's most optimal context size for the average workflow to…

I tried to add LM_STUDIO internal server as a model to be in the options, and I only tried it with 2000 context using the google gemma 7b model, I didn't get any results, even upping the number of tok…

ayoubachak updated 6 months ago
4
meta-llama/llama #720

Error 10049

Can someone please help me understand what I'm doing wrong here? (llama_env) C:\Users\afull>torchrun --nproc_per_node 1 example_completion.py \ NOTE: Redirects are currently not supported in Windo…

TheAnomalous updated 11 months ago
5
ollama/ollama #1155

Support export/import models as Docker Images to integrate w…

# Problem * Until we can solve the problem of the 403 access #676, there's no way to pull models from the Ollama server * At the time I'm writing this, I don't think the Ollama registry (Docker …

marcellodesales updated 7 months ago
1
unslothai/unsloth #212

RuntimeError: expected mat1 and mat2 to have the same dtype,…

In the Gemma 7b notebook, when rslora and dora are active, and the settings for 4-bit and 8-bit are off with r=8 and alpha=16, I encounter an error as described below. I have targeted all linear layer…

hcsolakoglu updated 1 month ago
8
premAI-io/prem-services #113

Download Petals models from Torrent

### Description Instead of downloading the models from HF, the services should fetch the weights from Torrent. ### Dependencies - This [implementation](https://github.com/premAI-io/from-hf-t…

filopedraz updated 1 year ago
2
ex3ndr/llama-coder #14

Documentation on how to use?

I've downloaded ollama I'm not sure what i'm expecting to happen I've pulled the model locally. There is no guidance on what is expected to happen or how to use? Is it supposed to run on save is t…

sherodtaylor updated 5 months ago
7
unslothai/unsloth #333

Download and saving unsloth/gemma-7b-bnb-4bit to a local fo…

First load model with internet connection ON model, tokenizer = FastLanguageModel.from_pretrained( model_name = "unsloth/gemma-7b-bnb-4bit", max_seq_length = max_seq_length, dtype = dt…

patrickjchen updated 2 months ago
16
jackMort/ChatGPT.nvim #383

Can we add support for local LLMs? Or how would you recomend…

Instead of using chat-gpt, I would like to try and use a local LLM. I am sure this would take some modifications, but I think we could potentially make this work, and would be an awesome addition to t…

wakywayne updated 4 months ago
17
pytorch/torchchat #782

[UX] We are too quiet about errors - in particular missing H…

The command `python3 torchchat.py where llama3` fails quietly presumably because I might not have the HF Token configured. I assumed the code was broken, though because I got a backtrace of the pr…

mikekgfb updated 1 month ago
3

上一页 1...39 40 41 42 43 44 45...100 下一页

1000+ results for codellama

1000+ results
for codellama