Bottleneck: finding abs paths of clouds from their basename takes forever

IGNF / myria3d

Myria3D: Aerial Lidar HD Semantic Segmentation with Deep Learning

https://ignf.github.io/myria3d/

BSD 3-Clause "New" or "Revised" License

172 stars 23 forks source link

Bottleneck: finding abs paths of clouds from their basename takes forever #82

Closed CharlesGaydon closed 1 year ago

CharlesGaydon commented 1 year ago

In datasets in which 50m x 50m patches of Lidar are already prepared, the number of clouds skyrockets from ~100 to ~10k. The current way of a) using a csv with signature "basename,split", b) finding the abs path based on the basename and the root of the dataset, takes too much time.

For instance with 28k patches (train split) a tqdm in debug mode gives :

19/28021 [00:36<15:22:09, 1.98s/it]