-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
**Describe the solution you'd like**
Th…
-
![image](https://github.com/user-attachments/assets/6b593263-f76c-4512-a697-69ad737134e3)
Error: Error applying IAM policy for cloudrun service "v1/projects/genai-ctimoteo/locations/us-central1/…
-
Reproducible on the version `24.10` when run against a NIM server serving `meta/llama-3.1-8b-instruct`.
# Steps to reproduce:
## Perform two profiling runs:
```
genai-perf profile \
-m meta/ll…
-
It looks to me that whereas the `profile` command supports usage of the `--tokenizer` argument, the `compare` command does not have such an option. That leads to a case when you can have a series of e…
-
I have built a small example using the python binding here https://github.com/tarekziade/onnxruntime-test/blob/main/run.py
to measure the inference speed on my Apple M1 and on a windows 11 box, using …
-
Followed instructions as per [Integrating the Spring AI Chatbot](https://github.com/odedia/spring-petclinic-microservices/?tab=readme-ov-file#integrating-the-spring-ai-chatbot) but application fails t…
-
During my experiments with the Llama3_2-3B model (and its built-in tokenizer) I have noticed the following issue: even if I set `--synthetic-input-tokens-mean 256 --synthetic-input-tokens-stddev 0`, n…
-
### Willingness to contribute
Yes. I can contribute this feature independently.
### Proposal Summary
The possibility to trace the execution of the genai metrics when using mlflow.evaluate()
### Mo…
-
Generative language models learn about patterns in language through training data.
Then, given some text, they predict **what comes next.**
-
**Expected Behavior**
Support Oracle Cloud Infrastructure (OCI) GenAI Embedding models
**Current Behavior**
No implementation exists
**Context**
Spring AI has support for Oracle vecto…