latency-constrained Search Results

1000+ results
for latency-constrained

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #1574

Support for sparsity?

Is it possible to do semi-structured sparsity for lower inference latency? Thanks!

BDHU updated 3 months ago
13
predibase/lorax #84

Does lorax currently support GPT2 finetuned adapters?

### System Info lorax:latest ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially supported command - [ ] My own modifications ### Reproduction @tga…

abhijithnair1 updated 9 months ago
19
arangodb/arangodb #9737

Feature Request: Low-overhead API

REST calls are okay for transactional requests, but due to the overhead associated with them, they're not ideal for high volume or low latency applications. For example, a single-read `return docu…

natejgardner updated 4 years ago
6
w3c/machine-learning-workshop #97

Action-Response Cycle bottlenecks in interactive music apps

The [Interactive ML - Powered Music Applications on the Web](https://www.w3.org/2020/06/machine-learning-workshop/talks/interactive_ml_powered_music_applications_on_the_web.html) talk by @teropa expla…

anssiko updated 4 years ago
1
Gl0dny/hexapod #35

Issue 26: KWS

- [ ] Train or download a KWS model for your hexapod's onboard computer. - [ ] Respond to keywords using pre-programmed responses or integrate with an AI like ChatGPT for dynamic conversation. - […

Gl0dny updated 4 days ago
5
ozwillo/ozwillo-datacore #30

Authentication - scalable auth

To be scalable, Datacore should not call Kernel for HTTP each request (ex. introspection endpoint to validate Bearer / access token header and get groups). Expiry time for this behaviour should be ex.…

mdutoo updated 9 years ago
1
redis/lettuce #2302

Feature request: Periodic flushing / auto-batching

## Feature Request ([branch](https://github.com/mtheos/lettuce-core/tree/auto-batch)) Hi team, have you considered periodic-flushing/auto-batching as a middle ground between auto and manual flushin…

mtheos updated 2 months ago
5
aws/containers-roadmap #1997

[EKS]: Add support for running WASM containers on EKS using …

### Community Note * Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…

bryantbiggs updated 1 year ago
4
retz/retz #157

only oldest jobs are passed to task planner

In my custom planner I want to sort jobs by priority, but it seems as if I don't get all of the jobs in the queue passed to `public Plan plan(Map offers, List jobs)`, but only the first *k* ones, wher…

tgpfeiffer updated 7 years ago
4
Azure/azure-sdk-for-js #4807

Direct mode / tcp support

Like already issued in the "old" repository (https://github.com/Azure/azure-documentdb-node/issues/78) it would be a big improvement if the new library would support direct mode / tcp connection. The …

janis91 updated 2 months ago
25

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for latency-constrained

1000+ results
for latency-constrained