elcronos / COVID-19

COVID-19 Detector from x-rays using Computer Vision and Deep Learning
314 stars 68 forks source link

Create a function that anonymize xrays #19

Open elcronos opened 4 years ago

elcronos commented 4 years ago

Once we upload an image of an xray. We would need to create a function that removes the ID in that xray. We need the images uploaded to be anonymous if want to use it for future research work (Otherwise, we will not get Ethical Approval)

Resources:

timrosenow commented 4 years ago

Easily done - I've already got a python class that does this. I'll modify and provide it.

timrosenow commented 4 years ago

I don't have permission to create a pull request or new branch. I have a file to upload, what is the best way to get it to you? Sorry i'm a bit of a git newbie :)

Why-Ay-Es-Haitch commented 4 years ago

I have utilized the EAST text-recognition model to put together a basic redaction tool to remove hardcoded text from image files. This needs to be done to mitigate privacy concerns. Unfortunately I don't have access to any x-rays with information on them to test the model. I have added the script at https://github.com/yash-parge/EAST-textRedaction for others to improve on.

elcronos commented 4 years ago

I just realized that AWS has textract. Which allows you to extract text from documents including images. The API also shows the position of the text in the image. Then, it will be easier to remove any text that could be in an image. I think this is the best solution. Anyone wants to work on this?

JuanSeBestia commented 4 years ago

Maybe remove the location in meta-info will be necessary?

Put random image for context image

timrosenow commented 4 years ago

Yep I will

From: Camilo Pestana notifications@github.com Sent: Monday, 23 March 2020 1:03 AM To: elcronos/COVID-19 COVID-19@noreply.github.com Cc: Tim Rosenow Tim.Rosenow@telethonkids.org.au; Comment comment@noreply.github.com Subject: Re: [elcronos/COVID-19] Create a function that anonymize xrays (#19)

I just realized that AWS has textract. Which allows you to extract text from documents including images. The API also shows the position of the text in the image. Then, it will be easier to remove any text that could be in an image. I think this is the best solution. Anyone wants to work on this?

— You are receiving this because you commented. Reply to this email directly, view it on GitHubhttps://github.com/elcronos/COVID-19/issues/19#issuecomment-602239917, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ALJO6WDI76VWMQDGLFEUPKTRIZAFBANCNFSM4LQ4U6GA.