-
# Generalized Distributional TV
## Overview
This is a proposal for a new TV type that encompasses probabilistic,
fuzzy, distributional TV types and more. In short it is a
distributional TV that may w…
-
1. [Binary Relevance Efficacy for Multilabel Classification](https://link.springer.com/article/10.1007/s13748-012-0030-x) > https://github.com/Gin04gh/datascience/issues/6#issuecomment-419388287
1. […
-
I am using bert-base-nli-stsb-mean-tokens model in an unsupervised fashion to get similarity between sentences.
It performs really good for some cases.
But on doing extensive analysis, I found some …
-
ProtoAtoms such as `FloatValue`, `LinkValue`, etc actually are sequences. For instance `FloatValue` is
```
std::vector
```
Why not rename them `FloatSeq`, etc?
It would be more consistent with …
-
I've been fitting some mixed models to genomic (transcriptomic) count data, and wanted to assess model fit. I've tried out a few models and a few options for R2, however they give wildly different R2 …
-
Submitting Author Name: Nicolas Casajus
Submitting Author Github Handle: @ahasverus
Other Package Authors Github handles: (comma separated, delete if none) @MatGreco90, @ChaabaneS, @xgiraud
Rep…
-
- [x] Hu, R., Li, S., & Liang, S. (2019). Diachronic Sense Modeling with Deep Contextualized Word Embeddings: An Ecological View, 3899–3908. https://doi.org/10.18653/v1/p19-1379
- **Barbara 10/11**
…
-
see also #3270
Dropping outliers or hard rejection weights is one of the easiest way to get high efficiency after using a high breakdown estimator in an initial stage. We can do the reweighting for…
-
@cormacanderson notes that:
In GM 1378 (Mawa) the inventory given lists a number of phonemes, i.e. /ʔ/, /ʔw/, /h/, /f/, /z/ that the source (Roberts 2009) explicitly says are not phonemic.
Updat…
-
In WordHoard and Monk we offered a number of count-based statistics and displays. Monk is no longer available for perusal, but the WordHoard documentation discusses some of the approaches we implemen…