Closed walterchenchn closed 3 years ago
Dear authors, thank you for sharing your code. I download the dataset from argoverse, then I want to preprocess the data. When I use 'python preprocess_data.py -m lanegcn', it take 5 hours but nothing output, the cpu occupancy rate is high but gpu is low. Thank you very much!
It is a problem but not an error.
The first thing you misunderstand is that the processing part should not consume GPU resource because it does not running on GPU (it has no much matrix computation) but on CPU. Therefore your CPU occupancy rate is high.
In my opinion, the data preprocessing strategy the author provided is not so elegant. I had preprocessed data using author's script, it did work but as you said, it toke a long time (merely evaluation set) and consumed much memory. The original strategy author use is saving all the preprocessed samples in memory and dumping a very very big output file at once, this causes 3 problems:
Later I improve the data preprocess script my self. If you really want to preprocess the data by your self, I can give you some advice:
Hope authors can update preprocessing part.
@zhaone Thank you for your reply! I will try your method
Dear authors, thank you for sharing your code. I download the dataset from argoverse, then I want to preprocess the data. When I use 'python preprocess_data.py -m lanegcn', it take 5 hours but nothing output, the cpu occupancy rate is high but gpu is low. Thank you very much!