local-llm Search Results

1000+ results
for local-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Future-House/paper-qa #645

Some questions about LLM settings?

1. How many LLMs are needed for `setting`? In your paper [PaperQA: Retrieval-Augmented Generative Agent for Scientific Research](https://arxiv.org/pdf/2312.07559.pdf), this paper seems to have employi…

bwnjnOEI updated 4 weeks ago
1
Future-House/paper-qa #630

Effect of batch size on answer quality and speed.

Hello, I changed batch size from 1 (default) to 8, 32 and saw no changes on paperQA behavioural (answer quality end speed), as follows : ``` settings=Settings( llm=f"openai/mixtral:8x7b",…

Snikch63200 updated 1 month ago
1
rstudio/rstudio #15113

Copilot autocompletion with local ollama LLM

Hi, Could you please add option for code autocompletion, similar to recently added github copilot, but based on local ollama LLM? Currently vs code and jetbrains have such option with continue a…

mbatiuk updated 2 months ago
2
All-Hands-AI/OpenHands #5169

[Bug]: blank model name in settings cause problem

### Is there an existing issue for the same bug? - [X] I have checked the existing issues. ### Describe the bug and reproduction steps When I create a local LLM service with llama.cpp, I have verif…

adrianzhang updated 2 hours ago
5
ObrienlabsDev/rag #1

Implement RAG for CSP LLMs or Local LLaMa LLMs

- see also https://github.com/ObrienlabsDev/blog/issues/47 - see https://github.com/ObrienlabsDev/rag/issues/4

obriensystems updated 2 months ago
5
NVIDIA/TensorRT-LLM #2452

Error with runner.generate in TensorRT-LLM 0.14.0 for Qwen E…

Environment • Docker Image: nvcr.io/nvidia/tritonserver:24.10-trtllm-python-py3 • TensorRT-LLM Version: 0.14.0 • Run Command: python3 ../run.py \ --input_text "你好，请问你叫什么？" \ --max_output_len=…

tedqu updated 3 days ago
2
ServiceNow/Fast-LLM #56

[bug] Sparse copy runs out of shared memory with many expert…

# 🐞 Describe the Bug Facing an `OutOfResources` error with 64 fine-grained experts and dropless MoE enabled, even though there is sufficient GPU memory. # 🔄 Steps to Reproduce Steps to reprod…

sohamparikh updated 1 day ago
1
OpenInterpreter/open-interpreter #1514

OLLAMA LLAMA 3.2 fails to run with JSON Encoding error

### Describe the bug interpreter --local Open Interpreter supports multiple local model providers. [?] Select a provider: > Ollama Llamafile LM Studio Jan …

meetr1912 updated 6 days ago
15
NVIDIA/TensorRT-LLM #2422

attempt to run benchmark with batch_size>=512 and input_outp…

System config: - CPU arch x86_64 - GPU: H200 - Tensorrt-LLM:v0.14.0 - OS: ubuntu-22.04 - runtime-env: docker container build from sources via official [build script](https://techcommunity.microsoft.c…

dmonakhov updated 4 days ago
2
NVIDIA/TensorRT-LLM #2357

openai_server error

System Info GPU： NVIDIA RTX 4090 TensorRT-LLM 0.13 quest 1: How can I use the OpenAPI to perform inference on a TensorRT engine model? root@docker-desktop:/llm/tensorrt-llm-0.13.0/examples/apps# pyt…

imilli updated 3 weeks ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for local-llm

1000+ results
for local-llm