-
# Pitch
## Summary
This week I want to analyze dialogues from The Office. I found a data set containing every line that every character has said in every scene of every episode of the TV show. I…
-
Submitting Author: Jerry Yu (@jy1909 )
All current maintainers: (@MoNorouzi23, @allan8392, @nassimgha)
Package Name: text_processing_util_mds24
One-Line Description of Package: This package is desi…
-
Image what would happen if we showed what was matched in realtime (rather than the post-write scenario we worked in), against the words matched.
The disconnected experience is okay, but I do think i…
-
I noticed that the examples.ipynb was out-of-date.
Here are the things that need to be changed/updated.
- [ ] Include new visualizers
- Class Prediction Error
- CVScores
- Manifol…
-
# Pitch
## Summary
So there's a sizable number of UFO fanatics who believe they have had spotted UFOs in the past. Lots of them log their sightings in the [National UFO Reporting Center](http://…
-
# Which #TidyTuesday Netflix titles are movies and which are TV shows? | Julia Silge
Use tidymodels to build features for modeling from Netflix description text, then fit and evaluate a support vecto…
-
### BERTopic in Weaviate for Cluster Analysis
### What
Large text datasets tend to be composed of many different topics. For example, Wikipedia contains text about animals, sports, medicine, and man…
-
In terms of functionality, the mid-term end goal is to achieve an offering of ML algorithms and pre-processing routines comparable to what is currently available in Python's [`scikit-learn`](https://s…
-
# Bug Report/Issue Tracking System
## Improve the quality of bug report contents
- [What Makes a Good Bug Report?(citation:287)](http://dl.acm.org/citation.cfm?id=1453101.1453146), N. Bettenburg, S. J…
-
Jurafsky, Daniel and James H. Martin. 2015. _Speech and Language Processing_. Chapters 6 & 7 ([“Vector Semantics and Embeddings”](https://web.stanford.edu/~jurafsky/slp3/6.pdf), [“Neural Networks and …