Call-for-Code-for-Racial-Justice / TakeTwo-DataScience

Call for Code Diverse Representation Problem 3 media bias data science
Apache License 2.0
8 stars 8 forks source link

Implement Machine Learning component V2 (dsmvp-v2) #9

Open naokiabe opened 3 years ago

naokiabe commented 3 years ago

As part of the progression of machine learning components with increasing levels of sophistication, implement version 2 ("dsmvp-v2") with the following characteristics:

A machine learning module that can learn to detect racially biased expressions in context based on input labeled data of <context, expression, classification> triples.

A possible implementation can make use of contextual text classifiers, such as those based on RNN = Recurrent Neural Networks such as the LTST architecture (Reference: https://spacy.io/usage/examples#textcat).

RNN allows the classifier to be "sequentially contextual", i.e. to classify a given phrase or expression dependent on the context in which it is used.

Coding of dsmvp-v2 should be similar to and share many aspects of how dsmvp-v1 in the repository is implemented, using Jupyter notebook and accessing the database via webapi, etc.

github-actions[bot] commented 2 years ago

:wave: Hi! This issue has been marked stale due to inactivity. If no further activity occurs, it will automatically be closed in 14 days.