2DegreesInvesting / ds-incubator

2° Investing Initiative, ds-incubator website / eBook:
https://bit.ly/ds-incubator-videos
1 stars 4 forks source link

Azure as way improve how we manage and use data #37

Open maurolepore opened 4 years ago

maurolepore commented 4 years ago

When: ASAP

Who is the audience?

Analysts, data managers, software developers at 2DII and beyond.

Why is this important?

Following a discussion on managing and using data (#35) we concluded we can improve. In general, we should control more effectively who can read and write data (including overwriting and deleting data). Technically, there may be more than one way, and the solution might look different from the perspective of an analyst, a data manager, or a software developer. Before we invest in any one approach we may want to explore a number of potentially good alternatives.

Taylor proposed to replace Dropbox with Azure:

"You would be able to download anything, but not edit. Uploading a new file with the same name is the only way to change a file, and by default we would stop people from uploading".

The goal of this meetup is to learn what the workflow would look like for an analyst and software developer if we replaced Dropbox with Azure.

What should be covered?

The content is TBD and totally up to @tposey28 and @AlexAxthelm.

I imagine a live demo of two loops over the entire data-cycle from the data base to analysis code. The first loop being the release and use of new data; and the second loop being the update and release of an updated version of the data released before.

(The analyst role may even be played by @jdhoffa (unsolicited mention) or other volunteer (anyone?), if they have the time to join a preparation pair-programming call with @tposey28 and @AlexAxthelm.)

Suggested speakers or contributors

@tposey28, @AlexAxthelm, @jdhoffa, @cjyetman, @maurolepore

Resources

(Mauro) If I were an analyst wanting to replace Dropbox with Azure, which is the one R package I should definitely learn about?

(Taylor) AzureStoris the only one you need (GitHub; CRAN).

Checklist

2h before

10' before

Start