-
- [x] Tokenize, normalize (lowercase)
- [x] Stop word removal (specific words for the domain)
- [x] Stemming: _use weak morphological treatments?_
"At least for English, morphological conflation t…
-
**Context**
We're currently trying to understand the SML version that is fit for scale. In the current version of SML, labels were generated manually. Upon discussion between product and technical, it…
-
Hi,
I am trying zero-shot topic modelling with BERTopic. The following fit_transform ran successfully:
```
topic_model = BERTopic(
embedding_model="thenlper/gte-small",
min…
-
znmeb updated
6 years ago
-
The search strings used were:
- `"Topic modeling comparative"`;
- `"Topic modeling comparison"`;
The resulting papers was exterminated by title and abstract, those judged as relevant are listed b…
-
Post questions here for this week's fundamental readings:
Grimmer, Justin, Molly Roberts, Brandon Stewart. 2022. Text as Data. Princeton University Press: Chapters 10, 12, 6, 13 —“Principles of Di…
lkcao updated
4 months ago
-
Post questions here for this week's oritenting readings:
Timmermans, Stefan and Iddo Tavory. 2012. “[Theory Construction in Qualitative Research: From Grounded Theory to Abductive Analysis](about:…
lkcao updated
4 months ago
-
Post your response to our challenge questions.
First, write down three intuitions you have about broad content patterns you will discover in your data. Plan an asterisk next to the one you expect m…
lkcao updated
4 months ago
-
### Description
it's crucial to perform in-depth data analysis and visualization to gain insights, discover patterns, and make informed decisions. This issue is focused on conducting an of the tex…
-
_Preface, I have tried to read through the current issues. I dont think that any issues raises what I am wanting. Issues like this https://github.com/MaartenGr/BERTopic/issues/2011 sound promising but…