genai Search Results - Githubissues

1000+ results
for genai

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #136592

[ONNX] Single model export for HF models prompt and token ph…

### 🚀 The feature, motivation and pitch The kv_cache is used only during the token phase, not during the prompt phase. As a result, the exported model currently works only with one of these phases, d…

BowenBao updated 3 days ago
2
vanna-ai/vanna #523

Authentication via VertexAI

**Is your feature request related to a problem? Please describe.** When we try to use Gemini LLM we only have option to pass it as an `API KEY`, what if we have a json file through which we want to c…

BassCoder2808 updated 5 days ago
1
cg-dot/vertexai-cf-workers #18

"code": 429, "message": "Quota exceeded for aiplatfo…

"code": 429, "message": "Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase re…

jarlin8 updated 2 weeks ago
5
triton-inference-server/server #7182

Is onnxruntime-genai supported?

Hey all, I have a quick question, is onnxruntime-genai ([https://onnxruntime.ai/docs/genai/api/python.html](https://onnxruntime.ai/docs/genai/api/python.html)) supported in Triton Inference Server's O…

jackylu0124 updated 4 months ago
2
docker/genai-stack #95

Unable to Find libnvidia-ml.so.1 When Using "docker compose …

Here is the result of my command. Is this error inside the container or outside? The weird part to me is: **genai-stack-pull-model-1 | pulling ollama model llama2 using http://llm-gpu:11434** T…

medined updated 1 month ago
5
microsoft/onnxruntime-genai #526

ONNXRuntime-genai doesn't release GPU memory after first inf…

I'm not sure if my issue is related to the issue [446](https://github.com/microsoft/onnxruntime-genai/issues/446) but here is what I experienced. The first time I load an ONNXRuntime-genai model into …

Positronx updated 2 days ago
1
google/generative-ai-go #118

streaming requests with tools may return a 500 error under c…

### Description of the bug: ```go package main import ( "context" "fmt" "log" "github.com/google/generative-ai-go/genai" "google.golang.org/api/…

douglarek updated 3 months ago
3
microsoft/onnxruntime-genai #823

Some answers in phi3-vision just return </s>

In [phi-3 vision directml](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct-onnx-directml) using either python or c# certain questions just return `` For example "Why is the sky blue?" r…

elephantpanda updated 3 weeks ago
7
google-gemini/generative-ai-python #446

Function Calling Does Not Work With Stop Sequences and Strea…

### Description of the bug: Function calling does not work when providing `stop_sequences` and `stream=True`. ### Actual vs expected behavior: Actual: ```python import google.generativeai as ge…

collindutter updated 1 month ago
3
Azure-Samples/apim-genai-gateway-toolkit #52

Remove secrets from bicep deployment outputs

### This issue is for a: (mark with an `x`) ``` - [X] bug report -> please search issues before submitting - [ ] feature request - [ ] documentation issue or request - [ ] regression (a behavio…

stuartleeks updated 1 month ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for genai

1000+ results
for genai