-
Hi.
First of all, thank you for making such a model available to us.
I am trying to get vector embeddings of abstracts of some of the articles in PubMed. But somehow I couldn't get the sentence embe…
-
### Description
This is really a "discussion" issue. I'm not sure at all that the idea is feasible:
I've been testing `luceneutil` with heavy KNN indexing (Cohere wikipedia `en` 768 dimension vec…
-
### 模型推理报错,请教出现错误原因
我用paddle版本,paddlepaddle-gpu 2.6.1
通过torch2paddle,转换得到模型文件目录:
├── inference_model
│ ├── model.pdiparams
│ ├── model.pdiparams.info
│ └── model.pdmodel
├── la…
-
From my understanding, yes, you can append arbitrary `u32` but then you can't serialize it without losing information and if you serialize it with losing information the validating node will verify th…
-
This issue is a place to share possible ideas to analyze the manifests.
Current ideas:
Counts:
- [x] Number of pages
- [x] Number of sentences
- [x] Number of words
- [x] Number of character…
-
https://eel.is/c++draft/vector.capacity#9.sentence-3
> `constexpr void shrink_to_fit();`
> *Effects*: [...] It does not increase `capacity()`, but may reduce `capacity()` by causing reallocation…
-
Hi,
I need to get the embeddings of a word or a phrase within a sentence. This sentence is the context of the word/phrase.
For example, I need the different embedding values of `big apple` in th…
-
### A note for the community
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to …
fungs updated
1 month ago
-
import nltk
from gensim.models import Word2Vec
from nltk.tokenize import word_tokenize, sent_tokenize
from collections import Counter
nltk.download('stopwords')
nltk.download('punkt')
with open('txt…
-
我看给出的代码中不包含aspect 提取和sentence encoder 部分的实现过程,且前向推理时也没有用到sentence hidden vector。
是否可以给出这部分的实现代码呢?