-
when i run the code as is i get an error below. Im guessing the following.
I believe words changed on download from NTLK
These become mismatched lengths at 126/55
```
training.append([ba…
-
In Assignment 1.3 it is written: "This will load the data in a bag-of-words representation where rare words (occurring less than 5 times in the training data) are removed". However, when I sum the wor…
-
:red_circle: **Title** : Text Classification for Spam Detection
:red_circle: **Aim** : Create a text classification system to detect spam messages using machine learning techniques.
:red_circle:…
-
Realizing that during "Updates..." the `sed` process consumes "100% CPU" for several minutes, I investigated it a bit (see also https://stackoverflow.com/q/77818891/6607497). Eventually I could reduc…
-
We already do a bag of words, and word counts. Would it be useful to anyone to expose this functionality for other classification uses?
Some other things to consider:
- [ ] N-grams
- [ ] Levens…
Ch4s3 updated
7 years ago
-
Define and hold very clean terminology in the article!
- image hash (binary string compared by Hamming distance)
- histogram (binary string saved in SOLR/Lucene as it is - example Color)
- histogram w…
-
Hi, I am using clustering algorithm to cluster all the sift points extracted from 3 images (As I intend to create a code book for bag of visual words). I notice that the generated centroids are diffe…
-
##### Idea
We need a widget for filtering by a certain property:
- length of a word
- beginswith, endswith
- contains
- POS tags (e.g. output only proper nouns)
Design should be similar to S…
-
The BAAL project uses dropout as a method for active learning. If you sample 10 predictions and there's disagreement, that might indicate that there's something about the example that's interesting.
…
-
I received these suggestions from @GeorgeS2019 via email:
https://github.com/Apress/pytorch-recipes
https://github.com/Apress/pytorch-recipes/blob/master/Torch_AI_7.ipynb
Has a few NLP exam…