Open amueller opened 9 years ago
FYI I was planning to use some material from my strata tutorial, but I also have to look through the 2013 notebooks more carefully again: https://github.com/amueller/pydata-strata-2015
I like to interleave notebooks and slides, I think they only did notebooks last time.
I like the general layout. It would be good to interleave and I will have to take a look through your slides - supervised and unsupervised together make sense to me, then we show how you can combine them to build pipelines.
Notebooks are good but I don't know if using them for the whole thing is better than having slides for the core, and like 4-6 good application/example notebooks.
On Thu, Mar 5, 2015 at 2:46 PM, Andreas Mueller notifications@github.com wrote:
FYI I was planning to use some material from my strata tutorial, but I also have to look through the 2013 notebooks more carefully again: https://github.com/amueller/pydata-strata-2015
I like to interleave notebooks and slides, I think they only did notebooks last time.
— Reply to this email directly or view it on GitHub https://github.com/amueller/scipy_2015_sklearn_tutorial/issues/2#issuecomment-77437108 .
I was quite happy with my strata talk, but it was only 1:30, not 7h (!!). I think with that much time, we should definitely include exercises, and I'll have a look what they did last time. Anyhow, for not I want to submit something and get that off my todo list, we can worry about actually creating the content later ;)
Sounds great to me. It will be fun! Whatever mods you need to make for the submission are OK with me
On Thu, Mar 5, 2015 at 2:57 PM, Andreas Mueller notifications@github.com wrote:
I was quite happy with my strata talk, but it was only 1:30, not 7h (!!). I think with that much time, we should definitely include exercises, and I'll have a look what they did last time. Anyhow, for not I want to submit something and get that off my todo list, we can worry about actually creating the content later ;)
— Reply to this email directly or view it on GitHub https://github.com/amueller/scipy_2015_sklearn_tutorial/issues/2#issuecomment-77439374 .
What do you think about whether to do first unsupervised learning or first model selection? In 2013 they did first supervised learning, examples, grid-search, and then unsupervised learning. In the outline I proposed, we'd do both supervised and unsupervised learning in the beginning, and then cross-validation and grid-searches etc.
Do you have a preference?
As we have quite a bit of time, I think it would be nice to start the whole thing by explaining the different kinds of learning, that is supervised, unsupervised, (maybe reinforcement) roughly, and then say that sklearn does both supervised and unsupervised and show the API.