kungfuai / kaishi

Tool kit to accelerate exploratory data analysis and data cleaning
https://kaishi.readthedocs.io/en/latest/
MIT License
11 stars 2 forks source link

Convnet labeler #3

Closed mwharton3 closed 4 years ago

mwharton3 commented 4 years ago

This (in work) branch will include functionality to label photo vs. document, orientation (nearest 90), and aspect ratio problems (and subsequent methods to fix them).

mwharton3 commented 4 years ago

Some notable issues that need fixing:

mwharton3 commented 4 years ago

@spencerR1992 @zzsi I'm going to go ahead and merge this since there are a ton of changes, but submit issues if you notice something problematic.

If you want to try to test what's been changed, make a small folder with documents and photos, add some random rotations, save, and then run the below commands:

from kaishi.image import Dataset
imd = Datset('path/to/images')
imd.run_pipeline()
imd.report()
imd.predict_and_label()
imd.report()
imd.transform_fix_rotation()
imd.save('path/to/output')

There will be some interim reports/etc. that you can check out, along with the output images. It's definitely not very good yet, but it's a working model at least.