pydicom / deid

best effort anonymization for medical images using python
https://pydicom.github.io/deid/
MIT License
140 stars 43 forks source link

Optimize replacement functionality #117

Open wetzelj opened 4 years ago

wetzelj commented 4 years ago

As discussed starting in PR #112 (comment), optimize the processing of recipe actions to more efficiently perform defined actions.

I think we would want to parse an entire recipe first, build up a dependency graph of sorts (that also caches originally values that might be needed) and then do the replacements the most efficient way possible, possibly instead of parsing over every action and checking for every field, we first create an assignment of specific actions to specific fields, and then do a much more informed replacement.

vsoch commented 4 years ago

I think now that we have the DicomParser, we could look into discussing this again (but once you have some time @wetzelj, don't worry doesn't need to be soon!)

vsoch commented 4 years ago

Is this still something we want to discuss, perhaps in the near to distant future? :)

wetzelj commented 4 years ago

Yes, but not quite yet. We're a few weeks away from releasing our end-user application into user acceptance testing. At that point, we'll see more testing with a diverse sets of images. I suspect that we'll see some minor bugs with the header deidentification once this starts. I'd like to get further along with this testing before starting on this optimization.

vsoch commented 4 years ago

okay sounds good :)