-
* The datasets for future improvement on either stopword-trainer, or additions/subtractions on the actual datasets
* The stopword lists
* Letter frequency tables
eklem updated
2 years ago
-
I just found out that 90% of the library size is represented by the frequency lists.
I wonder if it would be useful to create a version of the library that do not implement checks on the frequency …
-
There is some data that is not generated by OONI, but is quite useful to have inside of the database (and synched in some way) to support some data visualisations as well as explorer.
I think ooni-…
-
I understand this is quite an old repo, but I don't seem to be able to convert the .wav to a text file. the command line pops up and then disappeard but I'm not getting any confirmation or error showi…
-
```
Frequency data is important in various applications of linguistic data,
e.g. sorting or searching. For CJK there exist several sources of
frequency data built from large corpora. As the selection …
-
-
Would be great to support mutually exclusive instead of cumulative --aaf-bins. This can currently be simulated by running regenie multiple times with different variant lists included each time based o…
-
I'm trying to reproduce examples of [EM German](https://github.com/jphme/EM_German) with llamafile v0.6.2 in server mode.
The [example page](https://github.com/jphme/EM_German/blob/main/example_out…
-
Hey guys!
Two things here.
I'm running the latest build but have noticed a bit of an issue. I'm unsure if its the number of channels in the scanner, or something else. However, when scanning my se…
-
Tony Yu has a recipe for a histogram strip chart, it's a kind of boxplot with colors for histogram frequency
http://sourceforge.net/mailarchive/message.php?msg_id=28750326
attachment is at http://ww…