datalad-handbook / course

Talks and materials for workshops based on the DataLad handbook
Other
12 stars 3 forks source link

1.5 day workshop in Lucca #27

Open adswa opened 4 years ago

adswa commented 4 years ago

@mih and I will be giving a workshop on DataLad in Lucca on March 23rd-24th. This issue lists the TODOs and acts as a progress tracker. Please extend and edit as necessary. :)

Logistics

Software

Teaching

A Basics layout has been proposed by @mih and awaits feedback from Lucca

This is currently structured like this: Monday 23 Morning session 1 Datalad concepts and principles 2 Basics of local data/code version control + Hands on: tasks to exercise basic building blocks

Monday 23 Afternoon session 1 Modular data management for reproducible science + Hands on: implement sketch of a reproducible paper 2 Data management for collaborative science + Hands on: Using your infrastructure (Gdrive) to collaborate on a demo project

Tuesday 24 Morning session 1 Data publication + Hands on: Publish data on "GitHub" 2 Outlook (what is else possible, resources, use cases)

Resources to create

adswa commented 4 years ago

I have created a free GDrive account for testing (dataladtester@gmail.com). This gives us 15GB to play with.

adswa commented 4 years ago

rclone for GDrive notes:

When you use rclone with Google drive in its default configuration you are using rclone’s client_id. This is shared between all the rclone users. There is a global rate limit on the number of queries per second that each client_id can do set by Google. rclone already has a high quota and I will continue to make sure it is high enough by contacting Google.

It is strongly recommended to use your own client ID as the default rclone ID is heavily used. If you have multiple services running, it is recommended to use an API key for each service. The default Google quota is 10 transactions per second so it is recommended to stay under that number as if you use more than that, it will cause rclone to rate limit and make things slower.

Here is how to create your own Google Drive client ID for rclone:

Log into the Google API Console with your Google account. It doesn’t matter what Google account you use. (It need not be the same account as the Google Drive you want to access)

Select a project or create a new project.

Under “ENABLE APIS AND SERVICES” search for “Drive”, and enable the “Google Drive API”.

Click “Credentials” in the left-side panel (not “Create credentials”, which opens the wizard), then “Create credentials”, then “OAuth client ID”. It will prompt you to set the OAuth consent screen product name, if you haven’t set one already.

Choose an application type of “other”, and click “Create”. (the default name is fine)

It will show you a client ID and client secret. Use these values in rclone config to add a new remote or edit an existing remote.