llm-application Search Results

1000+ results
for llm-application

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

1b5d/llm-api #15

Illegal instruction (core dumped)

I presume there is a minimum CPU requirement like needing AVX2, AVX-512, FP16C or something? Could you document the minimum instruction set and extensions required. root@1d1c4289f303:/llm-api# p…

dbzoo updated 10 months ago
3
opea-project/GenAIExamples #97

Involvement with the CNCF Cloud Native AI Working Group

Hello, I'm one of the leads for the CNCF Cloud Native AI Working Group. It would be great if we can get some of the folks working on this initiative to help create a Cloud Native AI reference archi…

raravena80 updated 1 week ago
4
mlflow/mlflow #12798

[FR] Tracing for Langchain's Runnable.astream_events() and L…

### Willingness to contribute Yes. I would be willing to contribute this feature with guidance from the MLflow community. ### Proposal Summary At the moment, using MLServer autologging for Langchai…

lragnarsson updated 1 month ago
3
run-llama/llama-hub #242

Discuss: Support Llamaindex connector for MeltanoHub, which …

A generic interface into hub.meltano.com would be great. In that paradigm, the source connectors are called "extractors" or "taps". There are a few different ways we could create generic connectio…

aaronsteers updated 11 months ago
1
vllm-project/vllm #5162

[Bug]: Unable to Use Prefix Caching in AsyncLLMEngine

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: …

kezouke updated 3 months ago
13
explodinggradients/ragas #662

Testset Generation: Is going into continuous loop

**Question** I am not sure, what's happening. I see that the testset data isn't generating and just going into a continuous loop, exhausting the tokens of openAI **My Code** from ragas.testset.g…

Vtej98 updated 2 weeks ago
19
triton-inference-server/tensorrtllm_backend #285

Input tensor 'host_sink_token_length' not found when launch …

I installed tensorrtllm_backend in the follow way: 1. `docker pull nvcr.io/nvidia/tritonserver:23.12-trtllm-python-py3` 2. `docker run -v /data2/share/:/data/ -v /mnt/sdb/benchmark/xiangrui:/root…

xxyux updated 3 months ago
15
triton-inference-server/tensorrtllm_backend #333

Batching not working : QPS remains same on increasing batch …

### System Info - DGX-A100 - Triton Image : v0.7.2 ### Who can help? @kaiyux _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Ta…

RahulnKumar updated 4 months ago
6
vanna-ai/vanna #317

SQL generated by LLM be limited to the selected training dat…

Due to data security control requirements, can the SQL generated by LLM be limited to the selected training data instead of all the trained data?

tzh5477 updated 5 months ago
6
run-llama/llama_index #15935

[Bug]: draw_all_possible_flow returns a blank html

### Bug Description I am trying out the example specified in https://docs.llamaindex.ai/en/stable/examples/workflow/rag/ page. Please find my code below ``` from llama_index.core.workflow import E…

plaban1981 updated 3 days ago
26

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for llm-application

1000+ results
for llm-application