Image crop - Githubissues

G-200010 commented 4 months ago

I have a small question. You mentioned that you crop the Dota and xView to 800*800 with overlap of 100, and I found a crop method in this link of Dota and a crop method in this link of xView. Could you please give me some help of your experiment details of image crop? Thanks a lot.

h751410234 commented 4 months ago

（1）https://github.com/CAPTAIN-WHU/DOTA_devkit/blob/master/ImgSplit.py is a classic image slicing tool. You can adjust some hyperparameters according to the requirements of our paper to achieve cropping. Specifically, set gap=100 and subsize=800, and leave other parameters as default. This tool can handle both DOTA and xView datasets (some simple code modifications might be needed). （2）If you want to quickly run through our code, you can try using UCAS-AOD and CARPK for experiments in other domain adaptation directions. This does not require any slicing processing, only simple dataset formatting is needed.

G-200010 commented 4 months ago

Really appreciate for your reply. I think I got your method. I will try it later. I will also try the simple one. Thanks a lot.

G-200010 commented 4 months ago

I use the code you mentioned to crop the image and select 3 categories, but I got 5680 images in Dota train set, do you use some strategies in this process? Hope for your reply.

h751410234 commented 4 months ago

I don't recall doing any additional processing. Are you sure you're using the Dota1.0 dataset and have deleted all images that do not include the three selected categories?

h751410234 commented 4 months ago

I will upload my processed dataset to Baidu Netdisk in the next few days to facilitate its use and the reproduction of results by everyone.

G-200010 commented 4 months ago

I think I have done that, I first crop the images, then turn it to coco format, and select three categories during turning to coco format.

G-200010 commented 4 months ago

I use Dota 1.0 and task 2.

h751410234 commented 4 months ago

My processing steps are as follows:（1）Slice the large images.（2）Filter the annotations, retaining only those that include the three selected categories.（3）Delete images without any annotations.（4）Convert the dataset into the COCO format. I'm not sure if this matches your processing steps.

Regarding the selection of annotations, the version used is the original Dota1.0 annotation version, which includes 15 categories. I am not sure about the task-specific labels.

I will upload the download link for the prepared dataset to GitHub in the next few days to facilitate reproducing our results.

G-200010 commented 4 months ago

Okay. Thanks for your reply.

G-200010 commented 4 months ago

Could you please tell me just after slicing the large images, how many images do you get?

h751410234 commented 4 months ago

According to my records, the training set contains 22,381 images and the test set contains 7,638 images.

G-200010 commented 4 months ago

Really appreciate for your reply. We got the same number, I will check my code after slicing the large images.

G-200010 commented 4 months ago

Could you please give me some help (maybe the code) and details in selecting 3 categories. I try the pre-trained model of xView to Dota and get the following results. And I think the reason is that I do not select categories right. I got 1864 images for Dota val set. Hope for your reply.

hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh

G-200010 commented 4 months ago

And I get this error when using python evaluation.py --config_file {CONFIG_FILE} --opts EVAL True SSOD.RESUME_EMA {CHECKPOINT_FILE}

h751410234 commented 4 months ago

（1）I have uploaded my prepared dataset to Baidu Netdisk, and you can download it for use. （2）The xView2DOTA model we released is not the EMA model, so it will produce an error. You can use this command for evaluation: python evaluation.py --config_file {CONFIG_FILE} --opts EVAL True RESUME {CHECKPOINT_FILE}

G-200010 commented 4 months ago

Thanks a lot.

h751410234 / RemoteSensingTeacher

Image crop #4