-
Hi @MaartenGr
Continuing my work mentioned in: https://github.com/MaartenGr/BERTopic/issues/1138
Using these ML tokenizor is relatively slow specially on large datasets. For my test, on a datas…
-
Thanks for creating the awesome repository.
I'm currently experiencing issues with the model prediction and was hoping you could offer some guidance. (BerTopic Version: v0.13.0)
To provide some c…
-
Dear all,
I am facing large issues working with BERT. I have got a dataset of around 1 million tweets. Firstly, I want to train my model with 50 percent of my dataset; then in the second step I want …
-
# Problem
My team is using BERTopic to detect topics within a dataset. We have found that it is useful to create a higher level grouping of related topics into clusters. Having fewer groups has red…
-
I am having a couple of issues with online topic modeling. I have read all of the relevant documentation (I think), but I am still unsure if what I am experiencing is a bug, or if what I am trying to…
-
Hello Maarten,
I've installed this package on my local computer but getting "UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 7365: character maps to " error at dataloader step …
-
Hi Maarten,
Thank you for the fantastic work - I’m a huge fan!
I’m currently working on a corpus with documents from 2018 to 2021, created by a few authors. Would it be possible to generate topi…
-
Hello Maarten ! First of all thank you very much for this package, your work and your quick and great answers to the issues.
I have been running BERTopic on a dataset of 100 000 news articles in fre…
-
### Discussed in https://github.com/MaartenGr/BERTopic/discussions/1192
Originally posted by **jdweaver14** April 17, 2023
Hi everyone,
I apologize for not posting images here, but I am no…
-
Hello!
I am trying to run the `topics_over_time()` function, in order to later run `visualize_topics_over_time()`. However when I run `topics_over_time()`, it runs for `1 it`, and then it's stuck.
…