File_type is case sensitive, creates problems during prediction generation

olgaliak / active-learning-detect

Active learning + object detection

MIT License

100 stars 33 forks source link

This issue also occurs with .jpeg extensions not being recognized as .jpg, which is probably also not ideal. The more robust way would be to use something like imghdr to get the actual type of all files, and take only the ones that are of the desired type (which would then be narrowed down to either 'jpeg' or 'png'). Even replacing the linked code with something like this would probably work:

# all_image_files = list(basedir.rglob(filetype))
# replace this with:
import imghdr
all_image_files = [image_file for image_file in subdir.iterdir() 
                             if image_file.is_file() and imghdr.what(image_file)=='jpeg']

olgaliak / active-learning-detect

File_type is case sensitive, creates problems during prediction generation #50