-
Hoping this get's done, but will be a big enough task.
Would be nice to support added for this -
```js
let lexicon = {"House":["ProperNoun"],
"house":["Noun"]
}
…
-
```
Using the incremental HunPos implementation would allow for longer sentences
and higher performance.
http://arne-koehn.de/bachelorarbeit/inkrementelle-part-of-speech-tagger.pdf
https://gitorious…
-
```
Using the incremental HunPos implementation would allow for longer sentences
and higher performance.
http://arne-koehn.de/bachelorarbeit/inkrementelle-part-of-speech-tagger.pdf
https://gitorious…
-
### 1.1 part of speech tagging (3 points)
A. Preprocess the `pharma` press release to remove all punctuation / digits (so can use `.isalpha()` to subset)
B. With the preprocessed press release f…
-
This is a great project! I'm working on some automatic transcription software. All the speech recognition engines I've looked at produce a straight stream of words and I haven't come across anything t…
-
We need this in order to see if there is a link between part of speech used in text and the gender of the people reading that text.
### TODO
- store in a different table whole texts from websites
- us…
-
Using the FreqDist and ConditionalFreqDist from NLTK, build the uni-gram bi-gram and trig-gram models for both words and tags.
-
Hello,
when I try to run semafor, it stops in the Converting postagged input to conll phase.
Environment variables:
SEMAFOR_HOME=/opt/semafor
CLASSPATH=.:/opt/semafor/target/Semafor-3.0-alpha-04.jar
…
-
Search of portmanteau lemmas currently retrieves nothing. For example, the lemma of ⲙⲙⲟ 'mmo' (2nd person feminine) based on SC guidelines is ⲛ_ⲛⲧⲟ 'n_nto':
https://github.com/CopticScriptorium/tag…
-
Haven't considered edge cases – it may well be that any simple fix creates more of a mess than it fixes. But I wanted this listed among the issues, at least, even if it will be a wontfix.