Detail about the DFE - Githubissues

CXH-Research / DocShadow-SD7K

[ICCV 2023] A large-scale high-resolution dataset satisfies all important data features about document shadow, covers a large number of document shadow images.

MIT License

202 stars 13 forks source link

Hi, good day. We tried the number of blocks opposite to the encoder, which is 8, 4, 2, 2. At the same time, we also attempted other combinations. However, these combinations did not enhance the performance (only very slight improvement, or even negative improvement), but the amount of parameters increased significantly. To maintain the ability of inference under high resolution and faster speed, we chose the current config.

Moreover, you may try increasing the number of decoders by yourself, but at present, the number of decoders is the optimal choice. You can consider DFE simply as a U-Net.

CXH-Research / DocShadow-SD7K

Detail about the DFE #9