-
Post questions here for this week's exemplary readings: 2. Klingenstein, Sara, Tim Hitchcock, and Simon Dedeo. 2014. “The Civilizing Process in London’s Old Bailey.” PNAS 111(26):9419-9424.
-
Post questions about the following exemplary reading here:
Caliskan, Aylin, Joanna J. Bryson, Arvind Narayanan. 2017. “[Semantics derived automatically from language corpora contain human-like bias…
-
1) Download dataset.
2) Data exploration
3) Setup repository structure
-
I am often frustrated because I can only segment documents into sentences on corpus, but I came up with an idea to make it possible on tokens with boundary marker. I would use this often for word-embe…
-
I'm running a quick analysis to evaluate the effect of training corpus size on model performance on a fixed test set. The analysis is performed as follows:
* Choose a set of 30 test docs and 20 val…
-
http://discuss.contentmine.org/t/extracting-data-from-tilburg-funnel-plot-diagrams/386/18 says:
> The total number of raw documents is given as:
>
> ```
> url,doi,funnel_plot,nr_funnel,vector,not…
ghost updated
7 years ago
-
Post your questions here about: “[Text Learning with Sequences (Links to an external site.)](https://docs.google.com/document/d/1vHoYMFH-53UpE528xv_-xhSrkjUELI7ihfXmz3J_As4/edit?usp=sharing)” OR “[Tex…
lkcao updated
2 years ago
-
Not sure if this is the correct place to post a question, but here it goes:
I was reviewing the topic modeling code we went over in class as I was trying to figure out how to write the code for my fi…
-
Spacy models should be modified according medical corpus. For example:
`tokens['train'][0:10]: [['EMEA', '/', 'H', '/', 'C', '/', '551', 'PRIALT']...`
-
> - From the ECHOES Project report 2023
> - Continuation of #20
# Pitch similarity
- to search for specific pitch patterns across the corpus and to gauge the melodic similarity between chants, or…