NaNoGenMo / 2016

National Novel Generation Month, 2016 edition.
https://nanogenmo.github.io
162 stars 7 forks source link

"Out abasement aa Aeons" #139

Open enkiv2 opened 7 years ago

enkiv2 commented 7 years ago

Process: I swapped the 50k most frequent words with the 50k least frequent words in the corpus, limited to words in my unix dictionary file. (If I did it again, I'd probably separate the list by POS.)

Output: https://raw.githubusercontent.com/enkiv2/misc/master/nanogenmo-2016/lovecraft-freq-inverted.md Code: https://github.com/enkiv2/misc/blob/master/nanogenmo-2016/freq-invert.sh

dranorter commented 7 years ago

Interesting! This gives me an idea: Make the effect more or less based on how Lovecraft compares to a broader English corpus. So, words which Lovecraft uses often but everyone uses often get swapped with words which Lovecraft hardly uses but everyone hardly uses, whereas words where Lovecraft is different stay the same. Or vice versa.