JuanS286 / gif_auto_audio_description

A project that aims to generate the audio description of GIF's for visually impaired people
2 stars 2 forks source link

Pre-Processing: Data augmentation #20

Closed Mikeltec closed 2 weeks ago

Mikeltec commented 1 month ago

Code included in the PreProcessing-FrameExtraction notebook to help the 3D CNN model learn to identify key elements and actions in GIFs even if they are flipped or slightly rotated, leading to more accurate and generalizable descriptions.