uncoast-unconf / uu-2019

Materials for the 2019 uncoast unconference
11 stars 4 forks source link

R Package that documents data files and sensitive data (PPI) within the file #19

Open mkosmicki opened 5 years ago

mkosmicki commented 5 years ago

We've been up to our eyeballs in data security and mapping with GDPR and CCPA.

I think we can automate some of the data documentation process required using R code. I searched for a package but couldn't really find anything specific to the documentation needed to basically CYA and protect yourself from the people you engage with marketing/advertising.

ijlyttle commented 5 years ago

Hi @mkosmicki,

A little while ago, I started noodling with something I think would be along those lines - but so far, its the GitHub equivalent of "napkin notes to myself": https://github.com/ijlyttle/steward/issues/1

Forgive me that I have documented "better" my ideas for an implementation, rather than what it would do, or what problem it would solve. (I should work on that)

Hopefully this is the same thing you are looking-for - a way to make it easier to write the data-documentation for a package: http://r-pkgs.had.co.nz/data.html#documenting-data

I also have this crazy idea to make it easier to read and reformat dataset metadata so you could make, for example, a gt table of your data-dictionary.

Is any of what I'm saying what you had in mind?

mkosmicki commented 5 years ago

Yep! That's it.