mtm80 / russ-project

2 stars 0 forks source link

Rhetoric in Russian Presidential Politics

Our project will seek to analyze the language usage of the Russian presidential candidates. Focus will be placed on campaign speeches, advertisements, slogans, etc.. We will investigate how their language corresponds to their respective parties ideologies.

Candidates

Methodology

We're marking up occurrences of the following general patterns

unused?

Transformations

All transformations and runner scripts (for on my home machine) are located in the /util/ folder. Run the shell script at the UNIX command line. Requires Python 3.5 or greater.

util/gen-mystem-input.xslt

This transformation takes a speech that jives with our TEI specification and outputs a plain text document that one can feed as input to Yandex Mystem.

util/flatten-mystem-output.xslt

The transformation takes Mystem output (which uses an ad-hoc XML specification for morphological data) and formats the data so it is compliant with TEI's <w> tag. This data is then manually fed back into TEI compliant XML, merging the morphological data in its original markup.