Closed nooralahzadeh closed 3 years ago
A random rotation, a color jittering, a padding to a square resolution, a resizing to 224x224, and a tensor normalization are applied to images. These image transformation processes can be found in clinicgen/data/image2text.py.
Thanks for the reply, May I ask you whether this transformation process has been applied in the same manner on the Validation
and Test
dataset as the Train
dataset?
The random rotation and the color jittering are only applied to training data.
Thanks for the reply,
May I ask you about the value for the max-word
that you use for the generation? is it as default =128
or =50
?
For the experiment in the paper, we used max-word=128
.
Hi,
Would you please describe what steps (transform, pad, resize, ...) have you applied to provide CNN image features?
Thanks,