Open willyawan16 opened 1 week ago
Can you please double check that the path in data.data_dir=./data/ahmed_body
is correct? Maybe try using absolute path just to check?
Also, try limiting number of dataset pre-fetching workers: data.train.num_workers=1
Finally, see if the example works with reduced dataset, for example, to use only 2 train samples: data.train.num_samples=2
HYDRA_FULL_ERROR=1 python train.py +experiment=ahmed/mgn data.data_dir=/home/willy/modulus/modulus/examples/cfd/aero_graph_net/data/ahmed_body data.train.num_workers=1 data.val.num_workers=1 data.test.num_workers=1 data.train.num_samples=10 data.val.num_samples=5 data.test.num_samples=5
I changed my command as the above, and it passed the dataset loading problem. But why when I try to change the num_samples higher than that, it returns the same error?
So anything greater than 10 in data.train.num_samples
causes that error to appear?
From the error itself, it looks like something happens during dataset pre-loading in one of the graph loading processes.
Unfortunately, I could not reproduce the issue on my side.
You can try adding some simple print
s to create_graph function to see if there is a particular file or place where the error occurs (and keep num_workers=1
to simplify the debugging).
Also, which environment does this issue happen in?
Version
0.8.0
On which installation method(s) does this occur?
No response
Describe the issue
Failed to load dataset when trying to train aero_graph_net. Is there any way to fix this?
it stuck in the hydra instantiation as shown in the error log.
Minimum reproducible example
Relevant log output
Environment details