-
Is it possible to do semi-structured sparsity for lower inference latency? Thanks!
BDHU updated
3 months ago
-
### System Info
lorax:latest
### Information
- [X] Docker
- [ ] The CLI directly
### Tasks
- [X] An officially supported command
- [ ] My own modifications
### Reproduction
@tga…
-
REST calls are okay for transactional requests, but due to the overhead associated with them, they're not ideal for high volume or low latency applications.
For example, a single-read `return docu…
-
The [Interactive ML - Powered Music Applications on the Web](https://www.w3.org/2020/06/machine-learning-workshop/talks/interactive_ml_powered_music_applications_on_the_web.html) talk by @teropa expla…
-
- [ ] Train or download a KWS model for your hexapod's onboard computer.
- [ ] Respond to keywords using pre-programmed responses or integrate with an AI like ChatGPT for dynamic conversation.
- […
-
To be scalable, Datacore should not call Kernel for HTTP each request (ex. introspection endpoint to validate Bearer / access token header and get groups). Expiry time for this behaviour should be ex.…
-
## Feature Request ([branch](https://github.com/mtheos/lettuce-core/tree/auto-batch))
Hi team, have you considered periodic-flushing/auto-batching as a middle ground between auto and manual flushin…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…
-
In my custom planner I want to sort jobs by priority, but it seems as if I don't get all of the jobs in the queue passed to `public Plan plan(Map offers, List jobs)`, but only the first *k* ones, wher…
-
Like already issued in the "old" repository (https://github.com/Azure/azure-documentdb-node/issues/78) it would be a big improvement if the new library would support direct mode / tcp connection. The …