offloading Search Results

1000+ results
for offloading

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

VectorSpaceLab/OmniGen #118

Mac M3 Metal Performance Shaders (MPS) support?

I followed this guide: https://developer.apple.com/metal/pytorch/ I can use comfyui with hardware offloading using MPS. OmniGen errors out with: ``` No avaliable GPU, offload_kv_cache wiil be se…

apiening updated 11 hours ago
2
vllm-project/vllm #2394

[Feature Request] Mixtral Offloading

There's a new cache technique mentioned in the paper https://arxiv.org/abs/2312.17238. (github: https://github.com/dvmazur/mixtral-offloading) They introduced LRU cache to cache experts based on patt…

shixianc updated 1 week ago
2
dandxy89/dotfiles #12

Offloading

- https://www.ralfj.de/blog/ - https://noidea.dog/glue - https://xlinux.nist.gov/dads/ - https://blog.sulami.xyz/posts/what-is-in-a-rust-allocator/ - https://quickwit.io/blog/performance-investiga…

dandxy89 updated 4 months ago
1
huggingface/diffusers #8989

Sequential offloading bug with Stable Audio

### Describe the bug Sequential offloading doesn't work when using `pytest`, but does seem to work outside of tests. This is an issue, because we can't properly test sequential offloading on Stabl…

ylacombe updated 1 week ago
2
pytorch/PiPPy #1126

CPU offloading?

It seems like pipelining could possibly greatly simplify the implementation of a feature such as fairscale's OffloadModel https://fairscale.readthedocs.io/en/latest/deep_dive/offload.html Is this s…

Xynonners updated 5 months ago
2
janhq/cortex.cpp #462

idea: Add GPU offloading for larger/MOE models (e.g. mixtral…

**Problem** Jan is great, but I'm limited o the number of models I can run on my 16GB GPU. I saw there is a project called [mixtral-offloading](https://github.com/dvmazur/mixtral-offloading) that cou…

poldon updated 2 months ago
1
bridgecrewio/checkov #6754

CKV_AWS_378 is triggering for SSL offloading of ECS services

**Describe the issue** CKV_AWS_378 triggers on configurations which have HTTP targets. But in a lot of cases SSL is offloaded on the load balancer level, and further targets use HTTP protocol to inte…

Shanjohn updated 13 hours ago
1
containers/podman-desktop-extension-ai-lab #1442

Estimating model offloading capabilities

### Is your feature request related to a problem? Please describe We removed the misleading indicator `CPU` on the Model tables. But it would be interesting for the user to have some indication if th…

axel7083 updated 3 months ago
4
zephyrproject-rtos/zephyr #76003

Feature request: offloading capability of the W5500 driver.

Hello, I realize, this is probably a significant feature request as it would need pretty big modification of the W5500 driver. **Is your enhancement proposal related to a problem? Please describe…

flabou updated 3 days ago
1
turboderp/exllamav2 #578

Will it support CPU offloading?

Hi, thanks for the great library! I have heard some people saying EXL2 being very fast, but I would like to try the 70B llama model on a 24GB 4090 card, so it cannot be fit into the GPU using e.g. 4bi…

fzyzcjy updated 3 months ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for offloading

1000+ results
for offloading