-
@cctoombs @twood02
This commit includes the code that I showed at the demo presentation. The three main components I worked on for the presentation were finding heights from the depth map, testing ou…
-
I'm hoping to implement a simple email spam filter type thing using RubixML + Naive Bayes but the only example I can find that deals with text / bag-of-words is the IMDB sentiment analysis example whi…
-
```
The following is from a posting by Olivier Grisel. This is something we
should consider learning about and working on.
I wondered if you were aware of the recent developments around sparsity
p…
-
Update / 2021: Blog post and source https://bart.degoe.de/building-a-full-text-search-engine-150-lines-of-code/ allows making a search module index using bag-of-words and term-frequency/IDF approach. …
-
We have identified the following metrics to be relevant for benchmarking:
- Bag of Words
- CER/WER
- Flexible CER
- Reading Order
- IoU
- mAP
- CPU time
- wall time
- I/O
- memory usage
…
-
So with the new features like q_marks, e_marks, pos_score, etc, I still haven't been able to outperform the baseline performance of 45%, which was achieved using J48 and the StringToWordVectorFilter. …
-
-
Why did I choose this paper? Because it analyzes the effect of tweet length on topic modeling methods.
### Main problem:
Which model is better for topic detection in the short text (tweet)?
Does …
-
I retrieved the docker image like so:
`# docker pull akashin/coursera-aml-nlp`
`#python3 --version`
shows that this has python 3.5 installed
Unfortuantely python 3.5 has [this](https://stackov…
-
Use NLTK (will provide an article at some point (HAYLEY DO THIS NOW)) to create ner to recognize each character in the stories.