inference-engines Search Results

1000+ results
for inference-engines

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MansMeg/bayesbenchr #2

Inference engine input interface

Currently in this R version of `bayesbench` inference engines are defined like this ```R stan_vb

eerolinna updated 5 years ago
2
zou-group/textgrad #16

Asynchronous calls

Hi, Are you planning making textgrad llm calls asynchronous? I tried to start adding saynchronous methods to make at least evaluation calls and inference (everything that is forward) asynchrono…

ajms updated 3 months ago
3
Project-MONAI/MONAI #8018

Proposal: TRT Acceleration API improvement

There is a number of issues with current TRT acceleration path in MONAI: - For some networks it's only practical/possible to trace/export certain sub-module, like image_encoder. Current solution r…

borisfom updated 1 month ago
7
NVIDIA/TensorRT-LLM #1561

the inference results of TRT-LLM are different from the HF m…

### System Info NVIDIA GeForce 4090 Gpu ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported …

FanZhang91 updated 4 months ago
2
janhq/jan #3754

Doesn't start any local models after the latest update

### Jan version 0.5.4 AppImage ### Describe the Bug I can't start any local models on my machine after the latest update. The previous version worked fine with various models. ### Steps to Reprodu…

Graninius updated 3 days ago
6
NVIDIA/TensorRT-LLM #2240

Linear increase in latency with batch size

Hello, I am running some latency benchmarks using TensorRT-LLM on a Mistral 7B Instruct v0.3 model. My hope was that at small batch sizes the overall inference latency should not be impacted as much,…

mkserge updated 1 week ago
2
kubeflow/model-registry #130

Ability to discover all running models

**Is your feature request related to a problem? Please describe.** From a tooling standpoint, we need the ability to discover all running LLM endpoints, so we can pick one and use it as an AI assista…

fbricon updated 1 month ago
12
facebookresearch/sam2 #284

Is it possible to use Nvidia TensorRT to accelerate SAM2 in…

I`m not quite familiar with the Transformer model. There are more steps to do than other model with the Encoder and Decoder. Such as the last encoder block output needs to be as the input for the nex…

jackwei86 updated 3 weeks ago
10
michaelfeil/infinity #361

Issue running cross-encoder onnx model exported with optimum…

### System Info py3.10 infinity-emb 0.0.55 Running with optimum engine fails: ``` INFO 2024-09-13 15:17:02,874 datasets INFO: PyTorch version 2.4.0 available. …

rawsh updated 3 weeks ago
2
OpenCTI-Platform/opencti #6806

Ability for admin to create custom suggestions for reports

## Use case Following up on #6805 It would be useful to be able to create custom suggestions in the platform, rather than asking the OpenCTI developers to include new ones on a case by case basi…

securitiz updated 5 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for inference-engines

1000+ results
for inference-engines