horsicq / Detect-It-Easy

Program for determining types of files for Windows, Linux and MacOS.
http://ntinfo.biz
MIT License
7.63k stars 731 forks source link

Dictionary based text clasification. #17

Open MariasStory opened 8 years ago

MariasStory commented 8 years ago

Hi,

What do you thing about the possibility to categorize found strings from the executable based on dictionaries. I thing of having different dictionaries, like America English, English English, different languages, as well as, the dictionaries for specific programming languages, like C, C++, Python, Perl, JS, VB, ... At the end the scores are shown for different languages, maybe event with a mini map for distribution over the executable.

Greetings

horsicq commented 8 years ago

Hmm. Great! I have to think about it.