Open dmarx opened 6 years ago
Curated dataset: 2016 CLPsych ReachOut.com
David N Milne, Glen Pink, Ben Hachey, and Rafael A Calvo. 2016. CLPsych 2016 shared task: Triaging content in online peer-support forums. In Proceedings of the Third Workshop on Computational Lingusitics and Clinical Psychology, pages 118–127.
http://clpsych.org/shared-task-2017/
http://www.aclweb.org/anthology/W/W16/W16-0312.pdf - Includes an interesting (looking, at least) discussion of the ethics associated with projects like this
approach:
The three class is similar to the idea I had to classify /r/all vs /r/SuicideWatch vs. /r/Depression, except my goal is less to distinguish between possibly/strongly concerning than to distinguish between a depressive sentiment and a suicidal ideation. Might be equivalent.
CMU tool for lemmatizing twitter shorthand. Might be useful for reddit as well
context can be extremely useful for labeling, esp. discussion-related features
https://people.csiro.au/W/S/Stephen-Wan
stephen.wan@data61.csiro.au
https://scholar.google.com/citations?user=YMRsSGcAAAAJ&hl=en&oi=sra https://scholar.google.com/scholar?q=%22Stephen+Wan%22+%22suicidal+ideation%22&hl=en&as_sdt=0&as_vis=1&oi=scholart
.... Man, we have a lot of ideas in common. Might be another lab I should consider applying to.
Additional related research: