dmarx / SuicideWatch

Experiment to build a model that detects suicidality in social media activity.
1 stars 1 forks source link

semi-supervised noisy label augmentation #8

Open dmarx opened 6 years ago

dmarx commented 6 years ago

PU Learning: positive + unlabeled data

PU LEarning: cosine-rocchio PU algorithm, Li et al 2010 EMNLP

Perplexity based PU learning (stephen wan)

dmarx commented 6 years ago

relevance models - ~2001/2002

LM ranking, top N -> another model to extract salient terms -> add to query, repeat
use KL divergence instead of perplexity