llm Search Results - Githubissues

1000+ results
for llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fabge/llm-azure #7

api key for embedding models

After successfully adding our azure deployed `gpt4o_mini` and `text-embedding-ada-002` instances (many thanks for this plugin!, btw, [I've made a small PR to update the README](https://github.com/fabg…

0gust1 updated 3 days ago
1
raidendotai/cofounder #11

Backend code relooping

The backend code generation keeps relooping after finishing: log:complete: node:op:PROJECT::STATE:UPDATE {"context":{"streams":{},"project":"todos"},"response":{"success":false},"data":{"operation"…

VersaceXcodes updated 2 weeks ago
6
bolna-ai/bolna #27

Local Setup Calling function isn't Working

``` task_manager = TaskManager(self.agent_config.get("agent_name", self.agent_config.get("assistant_name")), bolna-app-1 | File "/app/bolna/agent_manager/task_manager.py", line 58, in __init__ …

Sr1v47s4n updated 1 month ago
14
NVIDIA/TensorRT-LLM #2450

Does recurrentgemma support quantization?

https://developer.nvidia.com/zh-cn/blog/nvidia-tensorrt-llm-revs-up-inference-for-google-gemma/ This post says gemma supports quantization, so does recurrentgemma support quantization?

daiwk updated 4 days ago
1
Pty72/RLKD #1

LLM引入Spearman Loss效率问题

@Pty72 哈喽，用SCC做LLM的KD会不会很慢？我第一次提出SCC做KD的时候就有考虑过在NLP与多模态也复刻一遍，LLM火的时候我感觉没资源这么玩，baidu的自驾朋友跟我说他们跨模态KD觉得太慢，我看你的代码是算了两次soft-rank，会不会更慢

Senwang98 updated 1 month ago
1
stackblitz/bolt.new #28

Better LLM Context for packages and imports

**Is your feature request related to a problem? Please describe:** Congrats on the launch! Very cool stuff, but one immediate limitation I noticed is you don't have realtime info about packages. li…

CodeFromAnywhere updated 1 month ago
5
isayahc/open-interpreter-toolkit #3

Local LLM addition

This is so good, just if this would work with local LLM such as phind also and not only with openAI API would be perfect.

IVIJL updated 2 months ago
2
ctrl-space-labs/gendox-core #180

Optimize LLM Question text size

The existing Template size of text submitted to the LLM is too big in terms of tokens wasted. For example this: ~580 Tokens ``` Context: Title: s3://gendox.organization.documents.dev/9228b56c-1058-4b…

sekasx updated 3 weeks ago
2
ggerganov/llama.cpp #9628

Bug: Failed to run qwen2-57b-a14b-instruct-fp16.

### What happened? I am trying to run Qwen2-57B-A14B-instruct, and I used llama-gguf-split to merge the gguf files from [Qwen/Qwen2-57B-A14B-Instruct-GGUF](https://huggingface.co/Qwen/Qwen2-57B-A14B-…

tang-t21 updated 1 month ago
3
NVIDIA/TensorRT-LLM #2158

KeyError: 'llava_llama'

Hi TensorRT-LLM team, Your work is incredible. By following the READme file for [multi-modeling](https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/multimodal/README.md), we were sucess to run…

tiend1 updated 1 week ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llm

1000+ results
for llm