inference-api Search Results

1000+ results
for inference-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

onnx/onnx #5352

[Feature request] Provide python API to define new custom o…

### System information Currently all new ONNX op defining should use onnx/defs/ c++ API. If people want to reuse this and define their own domain and custom op, it is not convenient. They code co…

maxwillzq updated 9 months ago
2
ultralytics/hub #927

Cloud Training Never Start/End

### Search before asking - [X] I have searched the HUB [issues](https://github.com/ultralytics/hub/issues) and found no similar bug report. ### HUB Component Training ### Bug ![image]…

GuppyMIS updated 1 week ago
2
deepjavalibrary/djl #2836

What's the solutions of concurrency for AI model inference i…

## Description What's the solutions of concurrency for AI model inference in DJL? Multithreads can access a model in the same time? Support Nvidia Triton? Will this change the current api? How…

SidneyLann updated 3 months ago
1
vllm-project/vllm #2948

AWQ Quantization Memory Usage

Hello! First of all, great job with this inference engine! Thanks a lot for your work! Here's my issue: I have run vllm with both a mistral instruct model and it's AWQ quantized version. I've quant…

vcivan updated 3 weeks ago
5
raidendotai/cofounder #40

Anthropic: Number of request tokens has exceeded your daily …

I Have Tier 2 subscription for Anthropic and this limit Model__________________________| Requests per Minute | Tokens per Minute | Tokens per Day Claude 3.5 Sonnet 2024-10-22 | 1,000_____________…

radrad updated 2 weeks ago
1
kierankyllo/mhs-data-collector #9

OPT: Parallellize

Collection could be carried out in parallel to inference. Collection Queue: 1 - collect the comments by subreddit (can be multithreaded by subreddit) -> inference queue 2 - carry out edge disco…

kierankyllo updated 1 year ago
1
continue-revolution/sd-webui-segment-anything #161

[Bug]: VRAM not being freed the same even after process is f…

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits of both this extension and the webui ### Have you updated WebUI and this exte…

monstari updated 8 months ago
1
fishaudio/fish-speech #646

report 500 error

### Self Checks - [X] This template is only for bug reports. For questions, please visit [Discussions](https://github.com/fishaudio/fish-speech/discussions). - [X] I have thoroughly reviewed the proj…

imilli updated 3 weeks ago
1
songquanpeng/one-api #1472

请问如何兼容fireworks的api?

它有两种接口`/v1/completions`和`/v1/chat/completions`，前者自定义渠道也不能用，不知道该如何添加渠道？ ``` curl --request POST \ --url https://api.fireworks.ai/inference/v1/completions \ -H 'Accept: application/json' \ -H…

anrgct updated 4 months ago
4
Ucas-HaoranWei/GOT-OCR2.0 #120

有serving代码吗

FanWan updated 1 month ago
1

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for inference-api

1000+ results
for inference-api