inference-platform Search Results

1000+ results
for inference-platform

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/tensorflow #70670

how to assign tensorflow operation running-time device

### Issue type Documentation Feature Request ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version 2.10.1 ### Custom code Yes ### OS platform …

immusferr updated 6 days ago
1
tensorflow/serving #2214

OP_REQUIRES failed at xla_ops : UNIMPLEMENTED: Could not fin…

## Bug Report Does Tensorflow Serving support XLA compiled SavedModels ? or am I doing something wrong ? ### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: [D…

kmkolasinski updated 3 weeks ago
8
vladmandic/automatic #3322

[Feature]: ControlNet Omni pipeline

### Feature description This is opened as a feature just to keep track of things. The recently-released ControlNet Omni model may require a specific Diffusers pipeline (or similar approaches) f…

lbeltrame updated 3 hours ago
2
google-ai-edge/mediapipe #5518

Mediapipe LLM build fails for Android X86_64

### OS Platform and Distribution Ubuntu 22.04, Android 14 ### Compiler version Build failure with CLANG 9.0.0 ### Programming Language and version C++, Java, Python 3.10 ### Installed using virt…

vraghavulu updated 5 days ago
1
microsoft/onnxruntime #21138

Quantized ONNX Model Still Has Float32 Input/Output Tensors

### Describe the issue After quantization, the output ONNX model had faster inference speed and smaller model size, but why are the input and output tensors still float32? I thought it should be u…

jenchun-potentialmotors updated 2 weeks ago
2
ultralytics/ultralytics #13888

YOLOV8 and ONNX Support

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

Avv22 updated 2 weeks ago
1
deepseek-ai/DeepSeek-V2 #21

Reproduce inference benchmark mentioned in the paper

I have a few questions about the inference efficiency of deepseek v2 1. > In order to efficiently deploy DeepSeek-V2 for service, we first convert its parameters into the precision of FP8. Ar…

zhouheyun updated 1 month ago
3
microsoft/onnxruntime #21156

[Performance] Failed to run Whisper inference after optimiza…

### Describe the issue I exported my medium Whisper model correctly. It could run the inference with the correct answer. After that, I optimized my model. I ran the command line: `python -m onnxrunti…

XciciciX updated 6 days ago
1
microsoft/onnxruntime #21288

[Performance] Unexpected prediction for OCR model in Flask m…

### Describe the issue I have an OCR model with the following architecture ResNet-BiLSTM-CTC OS environment: + cuda:11.6.2 + python 3.7 + onnxruntime-gpu==1.14.1 + torch 1.10.0 cpu cuda_pro…

KhanhDinhDuy updated 11 hours ago
1
pytorch/pytorch #128264

unexpected `inference_mode` interaction with `torch.autograd…

### 🐛 Describe the bug Calling `torch.autograd.functional.jacobian` inside inference mode silently returns all zeros. I'm not sure if this is the intended behavior - the documentation states: 'E…

hchau630 updated 3 weeks ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for inference-platform

1000+ results
for inference-platform