-
Hello,
I was reading the recent simCSE paper which referred to your paper when reporting the Average GloVe embedding results for the STS benchmarks. I originally created the issue in their [reposit…
-
# Allow Vectors Other than SBERT
Some of our features require vectorizing the chat data; by default, we use SBERT vectors, and we have a utility that checks that the embeddings exist in the appropria…
xehu updated
2 months ago
-
It is needed to modify the Dataset_from_sentences class in order to let it handle the whole dataset (wich comprehends more than 41k samples). Some possible implementation paths could be:
- serialize …
-
- [x] dimensionality reduction and embedding
- [x] clustering
- [x] zebrafish
- [ ] Expectation-maximization в обобщённом виде. Пример с монетками. Data science in the wild, выкачивание картинок из go…
-
Recognizing Steps to controlling the size of the data going into storage
Currently the system has 4 TB or storage. It is important to contoll the size of the data that is used for the model when sor…
-
Hi, thank you for your nice work!
Currently, I've been reproducing your paper.
I think you're using label(crnn_code) and glove embeddings(zmc_code, m_image) to create the images in your implementa…
-
Good morning,
Thank you for sharing the paper, code and pre-trained model for NLP text data. Your research work results are impressive. Because I am developing embeddings solutions for genes and pr…
-
Hi!
I was wondering if it was possible to initialize a GloVe model with one of the pre-trained embeddings before training it on a dataset of my own? I'm not sure to see how it can be done..
Than…
-
Thanks for sharing the code,
I am using python 2, got this error:
using coding utf-8, otherwise will got another coding error.
import sys
reload(sys)
sys.setdefaultencoding('utf8')
(deep…
-
I met this error when trying to run the code (only change Glove to 840B.300d but remain filename as 6B.300d)
Does anybody know how to fix this?
File "scripts/run_model/run_bimpm.py", line 267, in …