-
Based on my understanding, current strategy:
1. All ranks currently read and load the checkpoint.
2. All ranks also save and write the checkpoint.
I have a question regarding the HSDP case:
If…
-
### 🐛 Describe the bug
The loaded state dict is silently empty if the parameter still has grads. (Testcase modified from some dcp async_save tutorial.)
```
import os
import torch
import torch…
-
Kidney bionetwork asked us to provide all the Tier 1 fields that we already have, to help contributors fill the Tier 1 fields.
We decided to try to demo that on Krishna et al (#996) to see how long t…
-
### Is there an existing issue for this?
- [x] I have searched the existing issues
### Describe the bug
Visiting the console logs page causes Aspire to load logs. For DCP resources, the host calls …
-
Running with podman on Fedora 40:
```csharp
var minio = builder
.AddContainer("minio", "minio/minio")
.WithEnvironment("MINIO_ROOT_USER", "blah")
.WithEnvironment("MINIO_ROOT_PASSWO…
-
I would like to suggest something for co-distribution pacts or international sub-distribution, **JUST** in case it hasn't happened already. If the studio is not placed under the same (partial) owners…
-
launchSettings.json
``` json
{
"$schema": "http://json.schemastore.org/launchsettings.json",
"profiles": {
"http": {
"commandName": "Project",
"dotnetRunMessages": true,
…
-
Tracks TODO in code.
-
### Is there an existing issue for this?
- [x] I have searched the existing issues
### Describe the bug
This issue occurs when I do `dotnet run` in the AppHost project, It does not affect the runni…
-
### 🚀 The feature, motivation and pitch
DCP lacks support for objects which are lazy initialized. E.g.:
```
import copy
import torch.distributed.checkpoint as dcp
dl_state = {"dl_state": {"data…