-
On small text with url in it, english is almost always detected
Example :
an arabic tweet with an url :
``` Json
POST _langdetect?pretty
{
"query_string": "RT @Dr_alqarnee: \"رمضان شهر الرحمة با…
-
A classifier based German part-of-speech tagger. It has an accuracy of 96.09% after being trained on 90% of the German TIGER corpus. The tagger extends the NLTK ClassifierBasedTagger and implements a …
-
Hi there,
First of all, you may consider to have a google group for factorie users to ask some beginner questions, like this :)
My question is I know factorie can extract certain words for topics. …
-
before building a corpus, it is important to understand what sort of corpus will provide what sort of grammatical phenomena. In this study, we identified the need for multiple scopes per sentence and …
-
Instead of using a set of arrays to slice up commands for processing, perhaps use something such as:
https://github.com/fortnightlabs/pos-js
-
Go go go!