nazmulkazi / dataset_automated_medical_transcription

Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.
Other
47 stars 15 forks source link

README lacks information on provenance of the source data. #2

Closed jeb-orcl closed 1 year ago

jeb-orcl commented 1 year ago

Hi. We would like to use your dataset in some internal R&D work. However, Oracle is adamant about only using data that is appropriately sourced and licensed. The license on this repo is fine, but we are concerned with how your original starting data (the doctor/patient conversations) was sourced. Can you update the README to provide information as to how you obtained these actual conversations, whether there was consent obtained from the patients, and/or what rights you have to release this content?

Thank you.

nazmulkazi commented 1 year ago

This research was conducted at Montana State University and the university has access to these transcripts published by Alexander Street Press, check out the Transcripts adapted from Alexander Street section. We didn't collect, record, or transcribe the original interviews. You will need to contact Alexander Street Press for that. We only used the original transcripts to generate data (we had some special requirements) for our study that closely resembles real doctor-patient conversations.