inference-api Search Results

1000+ results
for inference-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

chengchingwen/Transformers.jl #109

Inference API

mentioned in #108. Currently we don't have an inference api, like the `pipeline` from huggingface transformers. Right now you need to manually load the model/tokenizer, apply them on the input data, a…

chengchingwen updated 11 months ago
5
InternLM/lmdeploy #2070

[Feature] api server部署方式下的logprob功能

### Motivation 你好。我看到文档中支持offline inference模式下，得到input logprob。请问api server部署方式下支持吗？如果不支持，请问近期会有plan吗？ ### Related resources #2041 ### Additional context _No response_

cjfcsjt updated 1 day ago
1
elastic/elasticsearch-clients-tests #83

Inference tests

In case it's helpful for others using these tests, I solved an error in the Serverless client with our [Inference](https://github.com/elastic/elasticsearch-clients-tests/blob/main/tests/inference/10_b…

picandocodigo updated 2 weeks ago
1
weaviate/Verba #140

Integrating Groq LPU Inference Engine API with Verba

Hi all! Here an instruction how to integrate Groq API with Verba. Obtain API from [](https://console.groq.com/login) 1. `pip install groq` 2. Create "GroqGenerator.py" at goldenverba/compon…

bakongi updated 1 month ago
1
elastic/elasticsearch #106185

[ML] Inference API chunking large documents

### Description Large documents need to be chunked otherwise tokens exceeding the model's limit won't be used. MVP - Use a sliding window approach - Chunk into 200 words - Try splitting on wh…

jonathan-buttner updated 3 months ago
1
elastic/elasticsearch #108258

[ML] Inference API better support for asynchronous tasks

### Description The inference API doesn't expose some of the same query parameters that the ml trained models API to provide management of asynchronous tasks. It would be helpful for users if the A…

jonathan-buttner updated 2 months ago
1
danielmiessler/fabric #555

Local Inferencing Endpoint Only? Still references api.aponea…

### Discussed in https://github.com/danielmiessler/fabric/discussions/544 Originally posted by **xJohnWhite** June 4, 2024 # Summary Using an inside-my-network OpenAI API-compatible inferenc…

xJohnWhite updated 1 month ago
4
google/yggdrasil-decision-forests #99

Cannot run inference with Javascript API in the Node.js REPL

Attempts to call `YggdrasilModel.predict` in a Node.js REPL yield this exception: `[ERR_INVALID_REPL_INPUT]: Listeners for uncaughtException cannot be used in the REPL`. We're using version `0.0.2` of…

akshaan updated 1 month ago
1
FunAudioLLM/SenseVoice #30

AssertionError: choose a window size 400 that is [2, 0]

File "/home/huyi/anaconda3/envs/tts/lib/python3.11/site-packages/gradio/queueing.py", line 532, in process_events response = await route_utils.call_process_api( ^^^^^^^^^^^^^^^^…

Hy-1990 updated 23 hours ago
3
EleutherAI/lm-evaluation-harness #1963

How to use a vllm hosted model?

Are there docs on best practices for using vllm hosted models? I create a model using python -m vllm.entrypoints.openai.api_server --model model_path and try running it as lm_eval --model lo…

darsh-essential updated 1 month ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for inference-api

1000+ results
for inference-api