llm Search Results - Githubissues

1000+ results
for llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

coleam00/bolt.new-any-llm #186

Support for github new markplace llm models api key

There is like a recently new github marketplace models that has free api key to gpt4o are you guys like planning to add it in this project

TUX-A1CUA0 updated 2 weeks ago
1
huggingface/optimum-habana #1327

Finetuning Llama3.1 70B Model OOM error with LoRa technique

### System Info ```shell When I use the k8s sample example for lora for llama3 8B model it works fine. But for 70b model it fails with OOM. Total number of GPUs: 8 x Gaudi3 GPUs Dataset: databr…

premmotgi updated 1 month ago
1
aws-samples/bedrock-claude-chat #592

[Feature Request] Want to control LLM model selection throug…

## Describe the solution you'd like Create model configuration file for UI (React). ## Why the solution needed After conducting a Proof of Concept (PoC) and determining that a specific model …

statefb updated 1 week ago
2
QwenLM/Qwen2-VL #525

是否有成功将 Qwen2_VL 模型拆分并在 Triton 中分别部署的案例或经验分享

我的目标是：将Qwen2_VL拆分为 Vit_model 和 LLM_model 两个独立的模型，分别将它们部署到不同的 Triton 服务器中。使用 Triton 的 Ensemble 模式，将这两个模型串联起来，实现与原始 Qwen2_VL模型相同的功能。在推理过程中，先使用 Vit_model 处理图像，然后将生成的视觉特征传递给 LLM_model，最终生成文本输出。

hello-carry updated 1 week ago
3
containers/ramalama #488

'ramalama ps' returns exception on macOS when no container-b…

On macOS Sonama I have one model running. This appears not to be running as a container but is working well, with Apple Silicon metal/gpu support. `podman ps` shows a variety of other containers (of m…

planetf1 updated 23 hours ago
3
glibsonoran/Plush-for-ComfyUI #121

Feature Request: Response_format parameter (for JSON outputs…

It would be cool if the advanced prompt enhancer could include the response_format parameter in order to enable JSON mode with LLMs. This could be just [simple JSON mode](https://platform.openai.com/d…

Jonseed updated 3 weeks ago
45
explodinggradients/ragas #1255

Error running local LLM

My code： ``` import typing as t import asyncio from typing import List from datasets import load_dataset, load_from_disk from ragas.metrics import faithfulness, context_recall, context_precisi…

minglong-huang updated 2 months ago
3
pieces-app/support #433

Unable to run local models on GPU

### Software Desktop Application ### Operating System / Platform Linux ### Your Pieces OS Version 3.1.6 ### Early Access Program - [ ] Yes, this is related to an Early Access Program feature. …

inirudebwoy updated 1 week ago
2
vllm-project/vllm #9051

[Usage]: Tried using vllm with GGUF models. Got an infer dev…

### Your current environment ```text The output of `python collect_env.py` ``` CODE: from langchain.llms import VLLM import time import uvicorn app = FastAPI() llm = VLLM(model="tiiua…

asokans11 updated 1 month ago
1
karpathy/llm.c #752

llm.c for inference

Do you have the equivalent simple C implementation of LLM but for inference of LLAMA models. I am trying to build a FPGA accelerator for LLM and a simple reference C code would be very helpful Thank…

ztachip updated 2 months ago
2

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for llm

1000+ results
for llm