inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kubeedge/sedna #289

bug from image "sedna-example-joint-inference-helmet-detecti…

I am deploying Example1：[Using Joint Inference Service in Helmet Detection Scenario](https://github.com/kubeedge/sedna/blob/main/examples/joint_inference/helmet_detection_inference/README.md). edge…

zz952332446 updated 2 years ago
9
arenasys/qDiffusion #88

Colab error

Appears when generating links > Traceback (most recent call last): > File "/content/sd-inference-server/remote.py", line 8, in > import storage > File "/content/sd-inference-server/stor…

eyetest42 updated 2 days ago
5
chroma-core/chroma #1367

[Feature Request]: Hugging Face text embedding inference cu…

### Describe the problem Feature request for https://github.com/huggingface/text-embeddings-inference, it new and cool good to have it. ### Describe the proposed solution create a custom Embe…

Crispae updated 1 year ago
1
vllm-project/vllm #4194

[RFC]: Multi-modality Support Refactoring

[[Open issues - help wanted!]](https://github.com/vllm-project/vllm/issues/4194#issuecomment-2102487467) **Update [9/8] - We have finished majority of the refactoring and made extensive progress fo…

ywang96 updated 1 day ago
90
bigscience-workshop/petals #12

Roadmap (tentative)

__Current tasks:__ - [ ] prototype bloom points system @borzunov (#6 ) - [x] local tensor parallelism ( #143 , using [BlackSamorez/tesnor_parallel](https://github.com/BlackSamorez/tensor_parall…

justheuristic updated 1 year ago
4
triton-inference-server/server #7386

Triton Rust Crate as In-Process Inference Engine

**Is your feature request related to a problem? Please describe.** Rust API for Triton Server to integrate Triton in-process with a Rust Server Rust is now a universally recommended language to deve…

asamadiya updated 3 months ago
2
triton-inference-server/tensorrtllm_backend #226

llama docs

https://github.com/triton-inference-server/tensorrtllm_backend/blob/main/docs/llama.md if possible add speculvative decoding example in llama docs.

MrD005 updated 10 months ago
3
Azure/azureml-examples #2192

The Forecast TCN model deployment through the UI does not wo…

### Operating System Windows ### Version Information Recently we have discovered a problem due to the error in the DNN scoring script file. **Please see the workaround in Additional information sec…

nick863 updated 1 year ago
1
cyan2k/molmo-7b-bnb-4bit #3

Assertion `-sizes[i] <= index && index < sizes[i] && "index …

Thanks for sharing your script to run the 4-bit quantized molmo-7b. Unfortunately, I am unable to run it on my server (Ubuntu 22.04 with 2x RTX A5000 48 GB VRAM) - the error trace is below. I wonde…

maxruby updated 1 month ago
4
DeepMReye/DeepMReye #81

Deepmreye_example_usage_pretrained_weights fails at train mo…

I am having some issues with the DeepMreye demo using the exemplary data from the 2 first participants from the sample dataset as instructed in the notebook "deepmreye_example_usage_pretrained_model_w…

angusolav updated 1 month ago
13

上一页 1...68 69 70 71 72 73 74...100 下一页

1000+ results for inference-server

1000+ results
for inference-server