-
http://127.0.0.1:8000/tmp/benchmark/QuickBenchmarking/
QAIC
-
The onnx file in the examples is work:
https://github.com/microsoft/onnxruntime-inference-examples/tree/main/mobile/examples/object_detection/android
but I use oridinal yolov8n.pt to transfor yolov8…
-
### Your current environment
```text
The output of `python collect_env.py`
```
### 🐛 Describe the bug
This issue is introduced by `block_softmax` kernel(part of `flat_pa`, see #169 )
For some …
-
This is a pre-requisite for the unified knowledge base.
Being able to use semantic search within a cluster has some prerequisites: it requires an inference endpoint deployed on the cluster, and deplo…
-
## Description
I am following this doc: https://awslabs.github.io/data-on-eks/docs/gen-ai/inference/GPUs/vLLM-rayserve
Once I run
```
cd data-on-eks/gen-ai/inference/vllm-rayserve-gpu
en…
-
### 🥰 Feature Description
Please consider adding the ability to display the inference speed for each interaction with the AI model.
### 🧐 Proposed Solution
This could be presented in a f…
-
**What would you like to be added/modified**:
Sedna is an edge-cloud synergy AI project incubated in KubeEdge SIG AI. Benefiting from the edge-cloud synergy capabilities provided by KubeEdge, Sed…
-
## タイトル: サンプルの因果関係の交絡を解消し、自己注意を適応的に抑制することによる顔動作ユニット検出。
## リンク: https://arxiv.org/abs/2410.01251
## 概要:
顔動作ユニット (AU) 検出は、AU の微妙さ、動性、多様性のために、依然として困難な課題です。近年、自己注意と因果推論という有力な技術が AU 検出に導入されました。しかし、既存の方…
-
## ❓ General Questions
I have surely installed tvm in my device which has an arm64 on it and I want to run mlc_llm on my device to do model inference. But when I installed mlc_llm on my device li…
-
### What behavior of the library made you think about the improvement?
I need to install torch, transformers, accelerate etc. even if I want to use outlines only with llamacpp backend.
Are these d…