-
I presume there is a minimum CPU requirement like needing AVX2, AVX-512, FP16C or something?
Could you document the minimum instruction set and extensions required.
root@1d1c4289f303:/llm-api# p…
dbzoo updated
10 months ago
-
Hello,
I'm one of the leads for the CNCF Cloud Native AI Working Group. It would be great if we can get some of the folks working on this initiative to help create a Cloud Native AI reference archi…
-
### Willingness to contribute
Yes. I would be willing to contribute this feature with guidance from the MLflow community.
### Proposal Summary
At the moment, using MLServer autologging for Langchai…
-
A generic interface into hub.meltano.com would be great. In that paradigm, the source connectors are called "extractors" or "taps".
There are a few different ways we could create generic connectio…
-
### Your current environment
```text
The output of `python collect_env.py`
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: …
-
**Question**
I am not sure, what's happening. I see that the testset data isn't generating and just going into a continuous loop, exhausting the tokens of openAI
**My Code**
from ragas.testset.g…
-
I installed tensorrtllm_backend in the follow way:
1. `docker pull nvcr.io/nvidia/tritonserver:23.12-trtllm-python-py3`
2. `docker run -v /data2/share/:/data/ -v /mnt/sdb/benchmark/xiangrui:/root…
xxyux updated
3 months ago
-
### System Info
- DGX-A100
- Triton Image : v0.7.2
### Who can help?
@kaiyux
_No response_
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### Ta…
-
Due to data security control requirements, can the SQL generated by LLM be limited to the selected training data instead of all the trained data?
-
### Bug Description
I am trying out the example specified in https://docs.llamaindex.ai/en/stable/examples/workflow/rag/ page.
Please find my code below
```
from llama_index.core.workflow import E…