unpackAI / DL201

Deep learning 201
6 stars 6 forks source link

Separate the data and the code into different github repositories #20

Open Philanoe opened 2 years ago

Philanoe commented 2 years ago

In order to have a lighter repository, making it faster to clone and pull each branch, it would be convenient to remove all the data (pictures, tables, etc) from the notebooks repositories. Instead of that, we could have a common data repository which could be used by several courses (DL101, DL201, python classes, etc).

Do the course designers confirm that those data are not updated really often compared to the code ? The only difficulty would be to call those data from the notebook. It could be necessary to use the full path of Unpack AI github data repository to access them.

Another advantage : avoiding "!git clone" operation at the beginning of the notebooks, which can make the notebooks quite slow to start, even though we just use a small part of that data,

jamescavanagh commented 2 years ago

Perhaps a similar strategy to .URLs used in fastAI, in order to download it from a server. Can we host the datasets on a server?