data2health / nlp-sandbox

Cloud-based sandbox for text analytics
MIT License
3 stars 1 forks source link

Get references to two NLP methods mentioned by Justin G. #1

Closed tschaffter closed 4 years ago

tschaffter commented 4 years ago

On Friday March 13, Justin G. mentioned two NLP methods that he would like Sage to deploy first in an EC2, then in an NCATS EC2. According to Justin, both method can be used using a CLI and web interface.

tschaffter commented 4 years ago

On March 13, Yao shared the following information about two NLP methods.

Method 1: Deidentification

Short description: Automated de-identification of protected health information from electronic health records.

Repository: https://bitbucket.org/MCW_BMI/notes-deidentification/src/master/

Method 2: UCSFPhilter

Short description: Philter is a command-line based clinical text de-identification software that removes protected health information (PHI) from any plain text file.

Repository: https://github.com/BCHSI/philter-ucsf

Resources

tschaffter commented 4 years ago

@yy6linda Who gave you the Box.com link (see above)?

yy6linda commented 4 years ago

Justin shared it with me. But the documents were removed already by people who created this link.

jaeddy commented 4 years ago

The MCW tool/service also appears to live in GitHub: https://github.com/jayurbain/ctsi-mcw-deid

EDIT Here's the MCW org, with what I assume are more "official" repos: https://github.com/MCW-BMI

yy6linda commented 4 years ago

Here is the link to the review of the two methods: https://docs.google.com/document/d/1xwebCoIt57iCYmmZFLfr96Da69-mD7Zepvsh1B0-Duc/edit

tschaffter commented 4 years ago

Links to resources for these two tools have been successfully compiled in this ticket.