SAFeSEA / pyEssayAnalyser

An essay Analyser & Summariser, using Flask for the API and NLTK for the language processing.
9 stars 4 forks source link

small problems with tokenisation #3

Open vanch3d opened 11 years ago

vanch3d commented 11 years ago

There are a few edge-cases creating havoc with the tokenisation, because of a period within the sentence, all being references:

The rest of sentence is tagged with "#-s:n#", certainly because of