Open epilys opened 3 years ago
Complex Word Identification
use pandoc to get plain text from .tex
files (but ignore \textlatin
and \textgreek
somehow?)
compare bag of words from plain text to some corpus
silence textlatin textgreek output with flag
silence page styles, headers
generate dvi
use dvi2tty
Identify words that'd be unfamiliar for a modern English speaker.
Resources