Closed FloLimebit closed 2 years ago
tidytext::unnest_tokens
uses "words" as default (which e.g. drops punctuation)
- what algorithm should be used for tokenization?
tidytext::unnest_tokens
uses "words" as default (which e.g. drops punctuation)
Yes that's perfect
we should use ,
instead of .
(publication standard)
maybe we should include an option to add line breaks to column names, otherwise the latex table is too small (some columsn are taking too much space)
For the data paper we need a main summary table for Open Discourse.
This table needs the following details broken down by each electoral term:
The final Table should look like this (fake data):
The function needs to return the table as tibble/data frame but also as latex-format so we can use it in the data paper. Checkout the packages
knitr::kable
andkableExtra
to create the latex table.Add a decimal point to the output numbers (e.g. by setting it globally with
options(OutDec = ".")