platisd / duplicate-code-detection-tool

A simple Python3 tool to detect similarities between files within a repository
MIT License
162 stars 30 forks source link

added support for bad file recovery #5

Closed whisperstream closed 3 years ago

whisperstream commented 3 years ago

This PR fixes two issues I encountered:

  1. I had some files that had invalid UTF-8 characters in the comments
  2. These errors caused the script to crash when there's an incorrectly encoded file