-
TCC
- [x] Descrição do método Bag of Words
- [x] Descrição do método k-means
Código
- [x] Implementar algum cache no pygov_br (veja o functools.lru_cache, p/ algo possivelmente fácil e semi-auto…
-
## Expected Behavior
It would be nice if [some of the features](https://radimrehurek.com/gensim/models/phrases.html) of the `gensim.models.Phrases()` tool could get implemented into the `doc.…
-
### Is your feature request related to a problem? Please describe.
We often need to predict the sentiments of a user to check what the person feels about a certain product and whether we could recomm…
-
This main to-do list links to all the other items!
Week 1: Welcome
- [ ] Meet everyone :wave:
- [x] Research Software Engineers
- [ ] Jean Golding Institute (Data Scientists)
- [x] Working…
-
Probably TF-IDF, but Word embedding would be ideal
-
How do you include the TF-IDF weights in this method?
Compared to simple MNB having count of plain bag of words, MNB with TF-IDF gets more accuracy.
How do you implement this?
www.cs.waikato.ac.nz/…
-
![image.png](https://raw.githubusercontent.com/joenzkimchan/pe/main/files/a6b88c4f-46af-4802-b599-a0cbfa94ce82.png)
Tags are not very nicely wrapped :(
In other words, the gap after colleague and fr…
-
We are now ready to start with the topic modelling, which we are waiting for next class' explanation to implement. Currently, we have songs from 5 artists (we added 3 new ones) and we have a column (a…
-
So with the new features like q_marks, e_marks, pos_score, etc, I still haven't been able to outperform the baseline performance of 45%, which was achieved using J48 and the StringToWordVectorFilter. …
-
There is an existing issue #1031 that mentions that hashing a vector input to a scalar as a desirable output. While we can expand the set of types supported by hashing immediately (as done in #1303), …