-
The protocols right now are digital representations of the physical documents. This means that we need to have information on how to handle "avstavning" and that different textblocks now are separated…
-
Processing [this XML file](https://raw.githubusercontent.com/welfare-state-analytics/westac_parlaclarin_pipeline/main/sandbox/xmlns-bugg/source/1933/prot-1933--fk--5.xml) file using [config file](ht…
-
Detailed steps to produce a corpus for 1867 to 2019:
A tagged corpus for the period 1900 to 2019 already exists (created using Sparv v4.1). The raw text files for 1867 to 1899 are also already avai…
-
Essentially, even if we know who was the speaker, the party affiliation is often unknown for (generally) earlier speakers, often 1920s-1930s.
This is largely due to them not having dates associated…
-
```log
KeyError Traceback (most recent call last)
/venv/lib/python3.8/site-packages/penelope/notebook/co_occurrence/tabular_gui.py in _compute_handler(self, *_)
…
-
https://github.com/welfare-state-analytics/welfare_state_analytics/blob/08df4046bb937daf802c0b01764238a690f69170/Makefile#L11-L11
-
https://github.com/welfare-state-analytics/westac_hub/blob/bfef6aab199f3653781ab52249c1ad1dda50c2e7/Dockerfile#L35-L35
-
Release that includes a number of new features as bug fixes.
Noteworthy changes are:
- Support for merging consecutive words into phrases (#127, humlab/penelope#57).
- Added PoS-padding/markers…
-
Från och med april ska Johan, Erik och jag börja arbeta med aktivt med riksdagsdebatterna och med de Jupyter-sidor som togs fram för detta under hösten 2020.
- Dessa Jupyter sidor behöver ses över …
-
Follow these steps to install the `humlab-westac` package:
Prerequisites
--------------
**Chocolatey** (Windows 10, recommended).
Chocolatey is a Windows __package installer__ that simplifies in…