inference-api Search Results

1000+ results
for inference-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

crewAIInc/crewAI #1540

[BUG]Issue Connecting CREW AI with Azure OpenAI

### Description I’m experiencing issues when trying to connect CREW AI with Azure OpenAI. After some investigation, I found that only version 0.11 works with Azure OpenAI, but unfortunately, this v…

jamil-z updated 6 days ago
1
elastic/elasticsearch #108260

[ML] Inference api cohere rerank unused fields

### Description The cohere rerank implementation allows configuring fields that probably don't apply. The implementation leverages the common settings here: https://github.com/elastic/elasticsearch/b…

jonathan-buttner updated 5 months ago
2
exo-explore/exo #366

exo labs seems to be suffering form a possible memory leak

Hi, When I run the exo command on mac and start the inferences using the completion REST API endpoint, the Python process seems to be increasingly using more and more memory. I have put a delay…

amirvenus updated 2 weeks ago
1
PeoplePlusAI/sunva-ai #63

Core: ASR latency improvements for English

Test out with various apis and figure out the best API that is accurate with minimal latency for English and integrate them. Separate issue will be created for in-house ASR en models hosted with in…

maximaminima updated 1 month ago
2
elastic/elasticsearch #112033

[Inference API] Hugging Face returns `Bad Request: Invalid s…

### Elasticsearch Version serverless ### Installed Plugins _No response_ ### Java Version _bundled_ ### OS Version N/A ### Problem Description When trying to create an inference endpoint usin…

maxhniebergall updated 2 months ago
1
mukel/llama3.java #8

Bundle all standalone models in a single project.

So far I've ported the following models to Java: Llama 3 & 3.1, Mistral/Codestral/Mathstral/Nemostral (+ Tekken tokenizer), Qwen2, Phi3 and Gemma 1 & 2 ... All models are bundled as a single ~2K li…

mukel updated 2 weeks ago
1
google-research/timesfm #172

Install error "ModuleNotFoundError: No module named 'jax'"

Hi guys, If I simply install the lib with "pip install timesfm" and try the example code described in https://huggingface.co/google/timesfm-1.0-200m: ``` import timesfm tfm = timesfm.TimesFm…

linpingta updated 1 week ago
2
microsoft/onnxruntime #22662

symbolic_shape_infer.py script not working for some models

### Describe the issue According to [TensorRT EP docs](https://onnxruntime.ai/docs/execution-providers/TensorRT-ExecutionProvider.html) one should do symbolic shape inference before executing the mod…

maaft updated 1 day ago
4
FlowiseAI/Flowise #2392

[HELP!!!!!] I set up a proxy in the container where Flowise …

**Describe the bug** I set up a proxy in the container where Flowise is deployed, but Flowise still times out when accessing huggingface inference api. How to solve this problem? **To Reproduce** …

xiaoxuanyo updated 2 weeks ago
2
openvinotoolkit/openvino_notebooks #2484

llava-multimodal-chatbot-genai run failed

Running Jupyter notebook of llava model: https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/llava-multimodal-chatbot/llava-multimodal-chatbot-genai.ipynb - Device: Arc 770 d…

Johere updated 2 days ago
11

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for inference-api

1000+ results
for inference-api