codellama Search Results

1000+ results
for codellama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #3117

Api /tags should include type for embedding model or llm

As the title says, it would be nice to have that information so we can filter out embedd models if we want to allow for model switching on a frontend

Hansson0728 updated 4 months ago
5
enricoros/big-AGI #456

[BUG] Compressing a compressed conversation generates error

### Description After an ongoing conversation (more than 4K tokens) with multiple models (llama2, codellama, tinyllama) via ollama, I compressed the conversation to the 'detailed' level. It worked fi…

james777b updated 6 months ago
1
smallcloudai/refact #91

Model memory usage / quantization

According to [this Refact blog post](https://refact.ai/blog/2023/self-hosted-15b-code-model/): > Check out the [docs on self-hosting](https://github.com/smallcloudai/refact-self-hosting) to get you…

coder543 updated 1 year ago
2
Doriandarko/maestro #31

Termination issue

There is repetitiveness when Orchestrator, Subagent, Refiner all are in sync, the program should terminate by giving the final output. Example as: Based on the conversation, it appears that we have re…

dkpal1504 updated 5 months ago
1
tracehubpm/tracehub #169

Turn it into GitHub action

Let's turn this Tracehub application into a configurable GitHub action. First of all, it will be more stable (hosted service can experience downtimes and so on) and there is no need to host it, and us…

h1alexbel updated 6 months ago
2
NVIDIA/TensorRT-LLM #802

Failing to inference multi-GPU Llama engine

**Env:** - Container: nvcr.io/nvidia/tritonserver:23.12-trtllm-python-py3 - TensorRT-LLM release: 0.7.1 - TRT-LLM backend repo tag: v0.7.1 - Model: Llama-2-70b - tritonserver deployed on 2 A10…

manarshehadeh updated 9 months ago
1
karthink/gptel #230

"Last message must have role `user`" when prompt is above GP…

Hi @karthink Thanks for the great package, first of all. I noticed, that it can be tricky to feed GPT with message that contains parts of previously generated response. How to reproduce: 1. O…

EugeneNeuron updated 1 month ago
4
h1alexbel/fakehub #25

mock pulls list endpoint `GET /repos/{owner}/{repo}/pulls`

expected status codes: * `200` OK * `404` not found, or auth failed expected JSON response: ```json [ { "url": "https://api.github.com/repos/octocat/Hello-World/pulls/1347", "i…

h1alexbel updated 4 months ago
1
huggingface/peft #2035

Cannot use prefix tuning on quantized Codellama

### System Info I'm trying to PEFT with quantized LLMs. When I used prompt tuning, LoRA, and IA3, it works. However, when I use prefix tuning on 8-bit codellama-7b-hf, it reports the following erro…

MabelQi updated 2 weeks ago
5
Pythagora-io/gpt-pilot #738

[Bug]: There was a problem with request to openai API: LLM d…

### Version Command-line (Python) version ### Operating System Windows 11 ### What happened? When using LM Studio I get the following error: There was a problem with request to openai API: LL…

Mariano215 updated 3 months ago
19

上一页 1...27 28 29 30 31 32 33...100 下一页

1000+ results for codellama

1000+ results
for codellama