llm-testing Search Results

1000+ results
for llm-testing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

soohoonc/llms #3

testing in llms

add a section about testing llms, this is crucial

soohoonc updated 2 months ago
23
Cinnamon/kotaemon #240

[BUG] - Local Model LLM works, but not Embedding

### Description If you download a gguf model and update the LLM URL settings to the proper port where kotaemon is loading the model, testing against the "ollama" LLM works. However, the Embeddin…

ajweber updated 1 day ago
4
Vaibhavs10/hf-llm.rs #3

Can't not get any LLM's feedback ?

Hugginface hub login successful Used gemma2-27b LLM to testing: cargo run --release -- -m "google/gemma-2-27b-it" -c Finished release [optimized] target(s) in 0.03s Running `target/re…

skyxiaobai updated 3 days ago
8
Azure/PyRIT #353

FEAT: Add Unify Integration for Multi-Provider LLM Support

### **Is your feature request related to a problem? Please describe.** PyRIT currently lacks built-in support for easily using and comparing multiple LLM providers. This makes it challenging for user…

KatoStevenMubiru updated 3 days ago
6
redhat-ai-services/ai-accelerator #32

Integrate with LLM testing framework such as Phoenix

details here: https://docs.arize.com/phoenix RedSAIA project integration: https://gitlab.consulting.redhat.com/redprojectai/infrastructure/appdeploy/-/tree/main/phoenix?ref_type=heads

diego-torres updated 1 week ago
1
NVIDIA/TensorRT-LLM #2102

How to add a new quantization method?

I have developed a new KV cache quantization scheme. I am now interested in testing its performance within TensorRT-LLM. I'm new to this project, so I am trying to understand the current implementa…

Davids048 updated 2 days ago
2
ggerganov/llama.cpp #9235

Bug: Error loading custom model in llama.cpp: Tensor 'blk.0.…

### What happened? I encountered an issue while loading a custom model in llama.cpp after converting it from PyTorch to GGUF format. Although the model was able to run inference successfully in PyTor…

zsq0216 updated 1 week ago
1
swarmauri/swarmauri-sdk #177

[API Change Request - Community]: llms/ShuttleAIModel.py an…

### Affected component llms/ShuttleAIToolModel.py ### Motivation Our testing indicates changes in the ShuttleAIModel, which have surfaced JSON-related errors: -- FAILED tests/llms/ShuttleAIModel_t…

cobycloud updated 2 weeks ago
5
triton-inference-server/tensorrtllm_backend #594

In multi-GPU mode, is passing the prompt_embedding_table par…

### System Info - NVIDIA A100 80G * 2 - Libraries - TensorRT-LLM: 0.11.0.dev2024052800 - Driver Version: 525.105.17 - CUDA Version: 12.4 ### Who can help? @byshiue @schetlur-nv ##…

vonchenplus updated 1 week ago
1
EricLBuehler/mistral.rs #763

Slow CUDA inference speed

This reports mistral.rs as being faster than llama.cpp: https://github.com/EricLBuehler/mistral.rs/discussions/612 But I'm seeing much slower speeds for the same prompt/settings. Mistral.rs ``…

ShelbyJenkins updated 3 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llm-testing

1000+ results
for llm-testing