-
Hi @ailich,
see https://github.com/Nowosad/supercells/tree/estimate_compactness.
I took your code, cleaned it a bit, decided to return all the values (and not just summaries), and added an R fun…
-
WSQ compression of a friction ridge considerably changes the quality of a friction ridge image. Are there techniques we can employ to reduce this effect? Perhaps a filter, or limiting the frequencies …
-
It's dejecting when looking lower in the global frequency list, seeing just how much space is being wasted by OCR errors vs. legitimate-but-rare words. For every hundred tokens of junk, you get one "g…
-
revisit the logic of #18 but focussing on the use of standardised Data Quality measures instead of PROV.
Stubs for schemas to test against the ISO 19157 DQ measure register are here [https://ogcinc…
-
As part of the data quality measures implemented for this project, we have developed SQL code that performs row counts and other data quality checks to make sure the correct number of records are incl…
-
note perplexity does not exist for BTM models, we can implement
- Coherence
- Average Intra-Cluster Distance
- Average Inter-Cluster Distance
- Purity
- Normalised mutual information
- Adjusted…
-
Currently the blob system doesn't scale beyond 10 or so columns. This is because we currently build every possible sub-table.
What is needed instead are smarter sub-tables, where we avoid building …
-
- n_ab / |n_b|;
- decision rules lenght;
-
Maybe I am being paranoid, but I think we need to put in place a few anti-troll (maybe also anti-bot?) measures. (Anti-bot measures especially for the registration form, don't you think?)
The easiest…
-
About Hacktoberfest contributions: https://github.com/evidentlyai/evidently/wiki/Hacktoberfest-2024
**Description**
The ROUGE (Recall-Oriented Understudy for Gisting Evaluation) metric evaluates…