Closed boqchen closed 1 month ago
Thanks for your prompt reply.
How do you save these 512x512 patches? I would expect them to be 10s of millions if not hundreds. Do you just save them in one folder as png files etc. or is there a more efficient way to save them so it speeds up the training.
The file structure is similar to following lines: ├── ACROBAT2023 (dataset name) │ ├── images │ │ ├── 100_ER (slide name) │ │ │ ├── x_y_size_size.jpg (pathc name)
Then we built an index for all files, and saved them into a json file: {"slide_name": [path1, path2, ....], xxxx } to read data, read the json first, and then iterate all paths
Hi,
Thanks for the great project. I have a few questions about the pretraining.
Thanks in advance!