-
Perform initial experiments with the contextual log line embeddings.
Our current embedding is based on aggregating (averaging) of per-token fastText embeddings. Contextual embeddings are expected t…
-
Running the example in the readme
```julia
using Embeddings
const embtable = load_embeddings(Word2Vec) # or load_embeddings(FastText_Text) or ...
```
fails with
```
ERROR: HTTP.ExceptionR…
-
代码中没有使用Fasttext word vectors吗,看到densenet_sgr.py中SGRLayer初始化把fasttest_embeddings置为None, 还是说加了效果不好
-
Hello,
I wanted to use context embeddings for words to perform word similarity tasks. Is there a way to get the context vectors for words using the fastText model file.
Thanks,
Aditya
-
While using [FBs fastText Python lib ](https://github.com/facebookresearch/fastText/tree/master/python) the [BioWordVec](https://ftp.ncbi.nlm.nih.gov/pub/lu/Suppl/BioSentVec/BioWordVec_PubMed_MIMICIII…
-
Great to see these projects on github!
Although TFIDF+LR is a great baseline model for text classification, another easy thing to try out is facebook's [fasttext implementation](https://fasttext.cc…
-
#### Problem description
I would like to fine-tune a fasttext embeddings model trained on wiki data on new in domain data,
I was using this code;
#### Steps/code/corpus to reproduce
model = Key…
-
Right now we are using `TfidfVectorizer` with its default options (basically word 1-gram). We should try a few different options and see how the accuracy changes:
- [ ] word 1-gram
- [ ] word 2-gram…
-
Hi,
as I understand, the option -pretrainedVectors allows ones to init the weights with some pretrained embeddings stored in a .vec file, with a vector for each word.
How can we init the subword…
-
**Summary:**
subcharacter information对于中文是有效的,那么日文又如何呢?研究发现subcharacter对于中文的提升效果在日文上并不稳定(我想应该是有片假名和平假名的缘故吧)。但是在一些汉字比较多的场景下,character ngrams效果确实有提高。不过在实验中,发现即使是enhanced skip-gram 也比不上 single-cha…