fotisAnagnostopoulos / CompCrypt

Here we will cooperate in solving the exercises of certain textbooks
1 stars 0 forks source link

Index of Coincidence in different occasions #1

Open fotisAnagnostopoulos opened 5 years ago

fotisAnagnostopoulos commented 5 years ago

It is evident that the IC as defined in our book it is an asymptotic approximation, valid only for a large portion of text. An interesting question to address is if it differs also within a set of different kind of small texts (i.e a technical description, a poem, etc) and what happens in the case of numerical characters.

Further, it would be of some value to find/describe/create an efficient way to calculate the expectation value of IC in the case of Greek language, Greek language with numbers, etc.

dmaroulidis commented 5 years ago

It's a very interesting proposition, and since I've, just, completed a similar task, I'd like to begin doing this.

fotisAnagnostopoulos commented 5 years ago

Nice. Now we need to use this tool for a number of texts. A plot of IC as a function of the text length could be useful in order to understand in which length essentially a random string differentiates from a meaningful one. Further, IC is different for different kind of texts?