-
hi, i'm interested in using this just for the guess-language part only (i.e. not the typo-mode setting or spellchecking) but using all possible languages.
is it possible that there's no japanese (j…
-
## Problem
For string comparisons, there are filter operators like `equals`, `startsWith`, `endsWith` and `contains`. Although these are already very useful, it would be nice to take full advantage…
-
## Task Overview
The objective is to create a trigram model from five free English works in Plain Text UTF8 format from Project Gutenberg. The process involves:
1. **Select Five free books from Pr…
-
Hi,
Thank you very much for the source code provided. Actually I translated the description to English, but still have some questions. It seems that the model only considers bigrams and unigrams. H…
-
**Is your feature request related to a problem? Please describe.**
We have an overview of most frequent bigrams / trigrams *including a search term* implemented, but for comparison with word embeddin…
-
In the current vanilla state, the Trigrams Cushion (Building `WuDangPuTuan`) is clearly and objectively a bad choice to use as a Cushion.
While there is a mod on the workshop to improve it, it simply…
-
For preparing multi-word phrases, it may be helpful to first look at which bigrams and trigrams are common in the text data.
Could we create some functions that create a list like this?
-
Breaking up #107.
-
Before comparing the file content using the Levenshtein or Jaro distance, first compare the two files using word-level trigrams to get the general sense of their similarity. Then, use the distance met…
-
Stylometric analysis is well understood and shockingly powerful even using only simple features like bigrams and trigrams. I can't find the thread right now but there's demos on HN where even small sa…