-
### Describe the problem
I find that most of the time, I already have the data I want to vectorized stored somewhere --
therefore copying it over to chromadb is not only wasteful, but also expose…
-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
Dear creators of BERTopic,
Thanks for your work and this package is amazing. …
-
In the LoopVectorize pass, when the -prefer-inloop-reductions flag is enabled and the reduction instruction is the intrinsic smax, the flag does not function correctly, resulting in reductions being p…
-
**#searchEngine**
import os
import nltk
import string
import numpy as np
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.metrics.pairwise import cosine_similarity
# S…
-
Hi. I am new to using GPU. I am working on adversarial machine learning and earlier I have used the Textattack library for one of my projects using Sklearn and Keras models. For that I created the cus…
-
# Environment
- Python 3.12.4
- Tensorflow v2.16.1-19-g810f233968c 2.16.2
- Keras 3.5.0
- TensorBoard 2.16.2
# How to reproduce it?
I tried to visualizing data using [the embedding Project…
-
I was using langchain weaviate modules as my library to manage my weaviate storage. But the main problem was that I wanted to use weaviate's local text2vec transformers but in langchain there was no w…
-
```
import jieba
def tokenize_zh(text):
words = jieba.lcut(text)
words = list(filter(lambda x: (len(x)>1), words))
return words
import numpy as np
from umap import UMAP
from skle…
-
`pure_sklearn.map.convert_estimator` function crashes during a conversion of a sklearn unit having a functional variable (a CountVectorizer with a preprocessor parameter in my case).
`ValueError: O…
-
## Description
TF-IDF is one of the most famous algorithms when it comes to keyword extraction from text. Your task is to create a function that will extract keywords from text using the TF-IDF algor…