davidskalinder / mpeds-coder

MPEDS Annotation Interface
MIT License
0 stars 0 forks source link

MPEDS Annotation Interface

The MPEDS Annotation Interface helps facilitate the creation of protest event datasets. While applied to the specific task of coding for protest events, this could also plausibly be used for the development of other types of event datasets or other types of text annotations tasks.

This system is built in Python using the Flask microframework. It can source articles parsed from Lexis-Nexis (using the split-ln.py script), Apache Solr, or XML files formatted in News Industry Text Format, such as the LDC's New York Times Annotated Corpus.

It also uses Bootstrap for CSS and jQuery for JavaScript. It only works in Firefox (for now).

Setup

To populate the database with example information, first run the setup script.

python setup.py

This will add five users: an admin (admin), two first-pass coders (coder1p_1, coder1p_2), and two second-pass coders (coder2p_1, coder2p_2). They will all have the password default). It will add a variable hierarchy for second-pass coding. It will also enter metadata for all the articles in the example-articles directory, and queue them up for the first-pass coders.

Then run the Flask test server with the following.

python mpeds_coder.py

Acknowledgments

Development of this interface has been supported by a National Science Foundation Graduate Research Fellowship and National Science Foundation grants SES-1423784 and SES-1918342. Thanks to Emanuel Ubert, Katie Fallon, and David Skalinder for working with this system since its inception, and to countless annotators who have put a significant time working with and refining this system.