-
I'm attempting to convert the quantized model [Meta-Llama-3.1-8B-Instruct-quantized.w4a16](https://huggingface.co/neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16) to ONNX using the ONNXRuntim…
-
Hi
first of all, thanks for this "awsome" repo that makes the integration between NVIDIA NIM and AWS EKS much easier. Based on [this blog post](https://aws.amazon.com/blogs/hpc/deploying-generative-a…
-
Hey @bghira
Just stumbled into an issue trying to deploy SimpleTuner as a dockerised serverless image on Runpod.
Everything in the code works fine locally and on a pod using the same hardware (L…
-
I the [documentation](https://developer.android.com/ai/gemini-nano#supported-functionality) we can see that the Edge sdk is supported by other devices like s24, pixel 8a etc. But in the readme of this…
-
Issue to track information related to Knowledge Graphs & GraphRAG implementation efforts
GraphRAG
Articles
- https://emergentmethods.medium.com/outperforming-claude-3-5-sonnet-with-phi-3-mini-4k-…
-
I have the following script (also in samples extract-quotes.genai.js). Am I misunderstanding how to make the
rest of the prompt (including the FILE def interact with the AICI.gen call?
```js
scr…
bzorn updated
6 months ago
-
### Describe the documentation issue
Current information limited to Cuda 11.*
**Need Update** and How to make it works with **Cuda 12.***
https://onnxruntime.ai/docs/tutorials/csharp/csharp-gpu.h…
-
MSVC generates compile error (newline in string literal) on the unrecongnized question mark character in this line (https://github.com/openvinotoolkit/openvino.genai/blob/master/src/cpp/src/text_callb…
-
### Description of the bug:
Hi everyone,
is this normal that response is terminating while defined stop_sequence shows in response?
I followed gemini API guide an set generation config as prop…
-
### TL;DR
This is to rebrand the AMI DevX eBook assets that are still relevant and used in PG and demand generation campaigns. It requires that I edit the copy to remove any old Compuware references a…