-
I've tried vaderSentiment on tweets about the topic "Amber Heard". In a sample size of 100, all of the tweets are negative towards the topic. Here are some issues I've encountered:
1. vader is bad…
-
Is your source list, Norvig.txt, the SOWPODS dictionary? That is, the official UK Scrabble dictionary?
SOWPODS includes all compound words and inflections of every curse word, racial slur, etc. ev…
-
Hi,
First of all, thanks for your plugin, which could avoid to use the obscure compound word token filter with hyphenation_decompounder (https://www.elastic.co/guide/en/elasticsearch/reference/2.0/an…
-
It would be helpful to provide phrase hints (context words) during inference time to boost probability of certain domain specific phrases in the transcription.
E.g. when passing an audio to python…
-
Although there are only a few pronouns (35 according to one enumeration, the *sarvAdi gaRa*), I find it difficult to 'make sense' of the declension of these commonly used words. In this presentation,…
-
The email genre of English-EWT lists file attachments, e.g. "Constellation Power (GISB draft).doc".
1. Should filenames always be tokenized into discernible linguistic words ("ConstellationPower(GS…
-
In issue #120, we described the possibility of showing the document in alternative views with different highlights. Now, with the four units of learning almost decided upon, we may actually need this …
-
@amir-zeldes
`לבנזילפניצילין, אב הטיפוס של הפניצילינים, פעילות נגד מרבית החיידקים הגראם-חיוביים`
From what I've gathered, prefixes are only to be applied if they're of a restricted list -
`בין,…
-
Hi Nickolay, I tried to find a description of some criteria as to how the dictionary is transcribed (from other sources). One feature that strikes me as particularly odd is the entries marked with [th…
-
Forwarding feedback regarding Kannada
-----------------------------
Main obsevation of new version is,
ದ that there is improvement but still there are many problems with kan.traineddata file.
…