dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.
http://dgl.ai
Apache License 2.0
13.37k stars 3k forks source link

[GraphBolt] Runtime warning for CUDA availability mismatch. #7513

Closed mfbalin closed 2 months ago

mfbalin commented 2 months ago

Description

Should warn users against errors like in #7512 and #7494.

When there is no visible GPU to the process, torch.cuda.is_available() returns False. So we need to check discrepancy only if torch.cuda.is_available() is True.

Checklist

Please feel free to remove inapplicable items for your PR.

Changes

dgl-bot commented 2 months ago

To trigger regression tests:

dgl-bot commented 2 months ago

Commit ID: 773bf56aa1bfb3889cb6f474d777dd4672a6b67a

Build ID: 1

Status: ❌ CI test failed in Stage [Lint Check].

Report path: link

Full logs path: link

dgl-bot commented 2 months ago

Commit ID: 7f8924a3485f56429bb409c17478ac14a1264a59

Build ID: 2

Status: ❌ CI test failed in Stage [Lint Check].

Report path: link

Full logs path: link

dgl-bot commented 2 months ago

Commit ID: 3199650f8d2e946ee124e27d4986a590a6f8855b

Build ID: 3

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

dgl-bot commented 2 months ago

Commit ID: 754fdd980163053eb248720c0f933388bbdb0a18

Build ID: 4

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

dgl-bot commented 2 months ago

Commit ID: ee19a72cdc9d198a3289d74ca4827313cdd98f02

Build ID: 5

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

dgl-bot commented 2 months ago

Commit ID: e8bc17fa77ff4a6a745ae68083514dd1cdac7715

Build ID: 6

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

dgl-bot commented 2 months ago

Commit ID: 826d8d066e45137a9c37034fac31976652653845

Build ID: 7

Status: ✅ CI test succeeded.

Report path: link

Full logs path: link