sxm Search Results - Githubissues

808 results
for sxm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/nvtrust #67

Does NVIDIA H100-SXM 0x2330 support cc?

I launched the confidential VM, however the QEMU complained with ``` [ 349.818805] NVRM: failed to initialize module. [ 349.902869] NVRM: The NVIDIA GPU 0000:01:00.0 (PCI ID: 10de:2330) [ 349.9028…

jianlinjiang updated 20 hours ago
9
parker-stephens/siriusxm-activator #7

sxm-server thread

@Richardk80 I'm trying to get the updated requirements for latest version and get this error.... `pip install -r requirements.txt Collecting certifi==2022.12.7 Using cached certifi-2022.12.…

apexiptv updated 2 months ago
50
360CVGroup/FancyVideo #14

Running time on H100

Hi, we're running the demo script for 768x768 input image and it takes 22seconds to generate a 2 second clip, however we're running on an H100 SXM GPU. I was wondering if this generation time is norma…

harveymilk updated 1 week ago
3
bytedance/flux #34

[QUESTION] Why flux gemm_rs is not faster than torch?

**Your question** Ask a clear and concise question about Flux. ``` $./scripts/launch.sh test/test_gemm_rs.py 4096 12288 49152 --dtype=bfloat16 --iters=10 torchrun --node_rank=0 --nproc_per_node=…

hxdtest updated 1 week ago
5
NVIDIA/nvtrust #55

Pass-through cc-disabled H100 to a non-confidential VM

We are testing with SEV-SNP+H100. The cc mode with a single GPU works fine by following the deployment guide. Now we want to test non-cc mode with a regular VM. First we `--set-cc-mode=off`. ``` …

gzs715 updated 1 day ago
2
PaddlePaddle/Paddle #65994

Docker paddle 无法通过run_check()测试

### 请提出你的问题 Please ask your question 报错如下 [2024-07-12 08:34:51,881] [ WARNING] install_check.py:289 - PaddlePaddle meets some problem with 8 GPUs. This may be caused by: 1. There is not enough GPU…

yidu0924 updated 1 month ago
8
NVIDIA/TensorRT-LLM #1807

cluster key option not working?

Hi, I tried the `--cluster-key` option with trtllm-build. I did the conversion with A100-80gb-sxm, then tried to deploy it on L4 after converting using the L4 option and it failed when starting up t…

tonylek updated 1 month ago
5
skypilot-org/skypilot #3540

[Catalog/show-gpus] Combine RunPod's A100 and A100-SXM

Another issue, it seems we have A100 and A100-SXM separated for RunPod while combined for lambda labs. We probably need to separate them for all the clouds. _Originally posted by @Michaelvll in htt…

concretevitamin updated 4 months ago
1
pytorch/pytorch #132964

ROCm MI300X sum() way slower than H100

### 🐛 Describe the bug even tho on `Tensor.copy_` we see major improvements on BW on MI300X compared to H100. On a similar memory BW bound op like `sum()`, we were able to achieve a read bandwidth …

OrenLeung updated 19 hours ago
2
PaddlePaddle/PaddleNLP #8612

[Question]: 求助，chatglm2 单卡sft内存溢出

### 请提出你的问题报错如下 Error Message Summary: ---------------------- ResourceExhaustedError: Out of memory error on GPU 0. Cannot allocate 428.000000MB memory on GPU 0, 79.153320GB memory has been a…

yidu0924 updated 1 month ago
6

上一页 1...1 2 3 4 5 6 7...81 下一页

808 results for sxm

808 results
for sxm