Open yaqlee opened 1 year ago
btw, when I run training on my local machine, the dataset was very little, as I use 'scenario_builder=nuplan_mini', and 'scenario_filter.limit_total_scenarios=0.001'.
Hi @yaqlee,
Do you have the same requirements and dependencies installed on your remote machines?
P.S. It is admittedly very difficult for us to help you with your custom code and custom cloud setup. We'll try our best to answer what we can.
I wrote a feature builder and target builder and created a model class that inherits from torchModuleWrapper. I also wrote a new objective class based on my model. These components run without errors on my local machine, but I encountered the following error when running run_training on a cloud GPU cluster:
I'm wondering what might be causing this error. Could it be because my model code doesn't follow the libtorch convention of specifying data types for each function's parameters? Any insights or suggestions on how to resolve this issue would be greatly appreciated.
Thank you!