llm-framework Search Results

1000+ results
for llm-framework

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime-genai #1037

Cuda / DirectML question

Hi there, you made fantastic framework for llms. But what I find very confusing is how to run this on cuda and direct ml. I simply don't know how to do it in C#.. I there any example? Second questio…

janjanusek updated 2 days ago
5
PygmalionAI/aphrodite-engine #792

[New Method]: VPTQ, Vector Post-Training Quantization

### The quantization format Hi all, We have recently designed and open-sourced a new method for Vector Quantization called Vector Post-Training Quantization (VPTQ). Our work is available at [VPTQ…

YangWang92 updated 3 weeks ago
2
langchain-ai/langchain #25772

OpenLLM cant connect to local server with error: "module 'op…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

SmolPandaDev updated 3 weeks ago
2
vercel/ai #3521

Custom Tool Parser for Open Source Models

### Feature Description When using LLM serving frameworks such as [vLLM](https://github.com/vllm-project/vllm) or [MLC-LLM](https://github.com/mlc-ai/mlc-llm) , or services that host open-source mod…

ShervK updated 6 days ago
18
OvidijusParsiunas/deep-chat #262

Adding ollama api connection

Hi, i want to do my own chat AI interface for study and personnal project. I find the library interesting, the only problem is that I use Ollama as LLM provider (local server for llms). So Ithink that…

ZaMeR12 updated 1 month ago
6
vllm-project/vllm #10294

[Feature]: Quark quantization format upstream to VLLM

Quark is a comprehensive cross-platform toolkit designed to simplify and enhance the quantization of deep learning models. Supporting both PyTorch and ONNX models, Quark empowers developers to optimiz…

kewang-xlnx updated 3 days ago
3
coleam00/bolt.new-any-llm #401

APICallError [AI_APICallError]: prompt is too long: 202609 t…

### Describe the bug APICallError [AI_APICallError]: prompt is too long: 202609 tokens > 200000 maximum at file:///C:/Bolt/bolt.new-any-llm/node_modules/.pnpm/@ai-sdk+provider-utils@1.0.9_zod@3.…

veerababumanyam updated 7 hours ago
1
lmstudio-ai/lmstudio-bug-tracker #126

ToolCall issue in LM Studio - Model : Llama 3.1 #75

ToolCall is not generating from the response of llama 3.1 model from LM Studio, when using langchain framework connecting through ChatOpenAI , Same Tool call is working fine with ollama for the same …

Vikneshkumarmohan updated 2 weeks ago
1
i-am-bee/bee-agent-framework #182

[RFC] Data Analysis Toolkit

**Is your feature request related to a problem? Please describe.** This feature proposal introduces a toolkit for SQL and data analysis, by enhancing current SQL tool and introducing few other conc…

tonxxd updated 3 days ago
1
QwenLM/Qwen2.5 #1018

[Bug]: vLLM got different results with PeftModelForCausalLM

### Model Series Qwen2.5 ### What are the models used? Qwen2.5-0.5B-Instruct ### What is the scenario where the problem happened? inference with transformers, deployment with vllm/PeftModelForCau…

chansonzhang updated 13 hours ago
4

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for llm-framework

1000+ results
for llm-framework