When we run the classification algo on the full dataset, we probably don't want to modify the article records directly. Especially if human coders have already classified it.
Idea: derived metadata (categories, relevant/not, location) should live in its own related table, which includes fields for how it was derived (human or algo). Could there be multiple classifications for an article, and choose which one to use?
When we run the classification algo on the full dataset, we probably don't want to modify the article records directly. Especially if human coders have already classified it.
Idea: derived metadata (categories, relevant/not, location) should live in its own related table, which includes fields for how it was derived (human or algo). Could there be multiple classifications for an article, and choose which one to use?