-
Hello,
I recently updated to the current Gensim version and after this update the following situation arises when using infer_vector (doc2vec):
After loading a saved model I use the infer_vector met…
-
code state is 03f80463342e861c8cb072c6aa8cc2916ee42d14
When I run the supposedly interactive docker run command,
```
docker run -ti --volume ${PWD}/model-generation/notebooks:/data/notebooks \
…
-
想请教一下您,ner_all这里的doc2vec是对一个txt进行训练吗?然后ner模型训练的时候也没有对整段话进行切分是吗?即如果Batch_size=1,那么一个txt中的一段描述就是一个批次?不知我的理解是否有偏差。
-
Make Jupyter Notebook in order to explore all content features from `content_dataset.json`
For train dataset and each technique:
- [x] Bag of words
- [x] N-grams
- [x] Tf-idf
- [ ] Averaging wo…
-
-
Hi, I want to extract the doc2vec features of those sentences in MS COCO. But I'm not quite sure how the preprocessing is performed.
It's said that the articles are tokenised and lowercased using S…
-
#### Problem description
I am installing gensim with `setup.py install`. I expected that this would only install needed files. I noticed that this installs files such as *.c *.cpp *.pxd *.pyx that …
pabs3 updated
2 years ago
-
Word2Vec training-loss isn't quite yet the epoch-based loss most would expect – as pending PR #2135 might address – but also `Doc2Vec` and `FastText` should offer functional, analogous reporting, and …
-
Hi,
Thank you so much for sharing the work!
I am wondering if you would like to upload the h5py files for other datasets as well as the scripts for the retrieval part?
-
Hello, thanks for your nice code. But I meet a problem that I can not find the 'complete_db.pickle' and ''db.pickle' in in the Politics dataset when I run 'train_doc2vec.py'.
`
def main():
data…
xzhws updated
3 years ago