tshrinivasan / OCR4wikisource

OCR for WikiSource using Google Drive OCR
GNU General Public License v2.0
33 stars 24 forks source link

Reduce unnecessary copy and intermediate files #106

Open Shreeshrii opened 6 years ago

Shreeshrii commented 6 years ago

Rather than copying files, they can be renamed, eg.

find -type f -name '*.pdf' | rename 's/pg/page/'

rename 's/txt_/text_for_page_/' txt*.txt