-
**#searchEngine**
import os
import nltk
import string
import numpy as np
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.metrics.pairwise import cosine_similarity
# S…
-
Hi,
I am facing issue. If I train bertopic on a same dataset multiple times, I am getting different number of topics .
As per the discussion in this thread: https://github.com/MaartenGr/BERTop…
-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
Dear creators of BERTopic,
Thanks for your work and this package is amazing. …
-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
running partial_fit starts to throw error after ~100 iterations
```
~/.venv…
-
`
class UserModel(tf.keras.Model):
def __init__(self,use_timestamps,use_distance):
super().__init__()
self._use_timestamps = use_timestamps
self._use_distance = use_distance
…
-
I am using tf_idf vectorizer on dummy data as shown in the code. If I print the shape of an element after vectorization, I get the output (14,).
When I save the vectorizer and later transform using t…
-
### Issue Description
In the Minimal Reproducible Example below (using a high-dimensional feature set due to TF-IDF feature extraction; inspired by a real task we are working on), the calculated SHAP…
-
``` def brand_replace_text(
self, texts: List[str], brand_regex: str, repl_term: str = "brandx"
) -> list:
"""Replaces top ngrams in a list of texts that match a given regex …
-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
I am stuck with the bug: set nr_topics=96 (min_cluster_size=60, min_samples=30…
-
Running embedded weaviate for the first time will always write this output to standard out:
> Binary /home/vscode/.cache/weaviate-embedded did not exist. Downloading binary from https://github.com/…