cpu-cache Search Results

1000+ results
for cpu-cache

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #8185

[Usage]: number of allocated GPU blocks depending on max_seq…

### Your current environment ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 20.04.6 LTS (x86_64) GCC ve…

vpellegrain updated 1 week ago
1
google/benchmark #780

[Docs] Using CPU cache sizes

It might be useful to be able to use the CPU L-cache sizes in the benchmark, e.g. when desiding which maximal Range to use. I'm not sure how that should be exposed though. It should be reachable wi…

LebedevRI updated 5 years ago
1
pytorch/pytorch #135085

xpu out of memory when larger context size given for `torch.…

### 🐛 Describe the bug The case comes from xpu [triton-benchmark](https://github.com/intel/intel-xpu-backend-for-triton/blob/llvm-target/benchmarks/triton_kernels_benchmark/flash_attention_fwd_benc…

ESI-SYD updated 1 day ago
3
pytorch/pytorch #135705

Multiprocessing with workers returning `torch.Tensor` and li…

### 🐛 Describe the bug When running the below code the python interpreter hangs: ``` from multiprocessing import get_context from torch import randn def worker(i): result = randn(1) p…

BenZickel updated 1 day ago
4
PKU-YuanGroup/Open-Sora-Plan #371

opensora/serve/gradio_web_server.py 的问题

在opensora/serve/gradio_web_server.py 里引用了 `text_encoder = MT5EncoderModel.from_pretrained("/storage/ongoing/new/Open-Sora-Plan/cache_dir/mt5-xxl", cache_dir=args.cache_dir, …

fallbernana123456 updated 3 weeks ago
3
pytorch/pytorch #134625

Compiled flex_attention backwards errors with BypassFxGraphC…

### 🐛 Describe the bug When compiling flex_attention, any backwards operation crashes with the error `BypassFxGraphCache: Can't cache HigherOrderOperators.`. Without compile its fine, but slow. I …

platers updated 1 week ago
13
ollama/ollama #6095

Keeps switching between cached and wired memory

### What is the issue? I offloaded 47 out of 127 layers of Llama 3.1 405b q2 on an M3 Max with 64GB of RAM. When I run the inference, the memory usage shows only about 8GB, while the cached memory…

chigkim updated 1 month ago
5
dotnet/runtime #76290

GC picks wrong L3 cache size on Linux

### Description In https://github.com/dotnet/runtime/issues/48937 it was found that my gen0 budget was 32MiB. Investigating this further, I believe it may even be as high as 64MiB which causes the …

smoogipoo updated 1 week ago
5
pytorch/pytorch #135330

torch.floor_divide not triggering ZeroDivisionError on GPU

### 🐛 Describe the bug Running torch.divide with 0 as the denominator does not throw ZeroDivisionError on GPU, neither does it result in inf. Executing on CPU throws ZeroDivisionError as expected. …

jiren-the-gray updated 1 week ago
2
bazelbuild/examples #487

[Bazel CI] Error: 'android_common' value has no field or met…

CI: https://buildkite.com/bazel/bazel-at-head-plus-downstream/builds/4081#01918cdc-edef-40fc-9a36-c1ec173e5a63 Platform: multiple Logs: ``` ERROR: [0mTraceback (most recent call last): Er…

sgowroji updated 2 weeks ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for cpu-cache

1000+ results
for cpu-cache