Closed bkatiemills closed 8 years ago
I'd like to put together an R lesson on working with nested data frames using tidyr and purrr.
@wmhall awesome, let's do it!
@radaniba still interested in machine learning ala #101? (I certainly am!)
@minisciencegirl did you get the contact deets for the raffle winner at #ubcdsd?
Hey that's ~3 already, we only need like 8 or 9 to get us to September - who else has one?
@BillMills totally
@BillMills the plan in #101 will probably require 4 or 5 rounds at least how are you planning to schedule this ?
"@minisciencegirl did you get the contact deets for the raffle winner at #ubcdsd?" that'd be me. I agreed to lead a session on sonifying time series data
I'd love to see something about Jekyll, Sphinx or something like that, building dynamic webpages. Getting a bit away from R, but I think it would be helpful.
Is there an appetite for any of the following?
Also, as a learner, I would love it if anyone could teach a neural networks/deep learning session at any point!
@jrebane These all sound great. I'd be very interested in seeing the R side of natural language processing and network analysis.
@markjohnsonubc yes! that talk sounds awesome, I am v. looking forward to it - we're also going to re-vote the time for our meetings soon, so when we have a slot you are welcome to first pick of which one is more convenient for you.
@jrebane these are all awesome, and sound like they'll have some overlap with @radaniba's machine learning sessions - should we possibly coordinate there?
@radaniba that's up to what works for you - we could totally run all 4 or 5 sessions, possibly at the expense of some of the coworking days. Or, we could do this as a web series in parallel with the regular sessions, since I know it's pretty inconvenient for you to make it up to UBC. Would be great to think about packaging this up as a little mini-series for sharing and reuse, too!
@BillMills web series like live broadcast ? or recorded sessions
@radaniba I was thinking both! But totally up to you if you're the one in front of the camera :)
haha, recorded would work super great for me, but will be live answering questions, that will work better as I will be in control of the recording time and will answer any question on the fly during the broadcast
what will be the dates (start - end) of this summer session ? 2 months ? 3 ?
@radaniba nominally May through August, but it's all pretty flexible - the determining factor is, as always, the whim and convenience of the speakers!
Looks like we have many great sessions being proposed here! Definitely keep them coming!
Next up: we need to decide on a good meeting time for everyone. As per usual, we plan on meeting up once a week, for about an hour. Please highlight your available times in this poll: http://whenisgood.net/zqr53wz
Let's try to fill out the poll by May 6 so we can get the session started!
Note that the calendar day doesn't matter, just the day of the week for our usual thing. See you all soon!
I'd also like to talk about purrr
, as I want to incorporate it into STAT 545 this fall. So @wmhall and I could maybe team up? Either create one lesson jointly or coordinate our stuff to be complementary? There is plenty to talk about.
@jennybc excellent! Will do. It's looking like we're going to be assembling a schedule the week of May 9, so we will post possible date then.
@jennybc teaming up sounds great. It might make the most sense to just coordinate our stuff to be complementary. I was thinking of focusing on using nested data frames (via tidyr
) in a split/apply/combine workflow, so purrr
wouldn't be my focus. Maybe when scheduling happens we can try and make sure that my session comes soon after yours so that people are already comfortable with purrr
What about "documentation", how to write good documentation, what works, what doesn't, how to host it, where to host it, &cetera.
This sounds awesome! @SimonGoring: would you be willing to run your Shiny/Leaflet session again? Sorry for missing it last time!
@SimonGoring that's a great idea. documentation is key.
On Wed, Apr 27, 2016 at 6:50 PM, Amy Lee notifications@github.com wrote:
This sounds awesome! @SimonGoring https://github.com/SimonGoring: would you be willing to run your Shiny/Leaflet session again? Sorry for missing it last time!
— You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub https://github.com/minisciencegirl/studyGroup/issues/107#issuecomment-215283742
👍 @SimonGoring for documentation great idea, python users could be interested in sphinx
@wmhall Agreed we can coordinate. purrr
comes up in the workflow you discuss or, at least, it can. Because it can be really handy inside mutate()
to work on the list columns that result from nesting more complicated structures in a data frame.
So yes I could give a more general purrr
overview, which would then set it up for a mention by you. There's definitely plenty for two different-but-complementary sessions here.
@radaniba @morien yes, I've been using sphinx and readthedocs. I've figured out some fun stuff (like embedding code into the documentation), checked out a few templates & stuff. I think I could do a general session on Sphinx & readthedocs (although I'm really a sphinx neophyte) and we could maybe have a general talk about good documentation, does anyone want to add some roxygen2
talk to that? @jennybc or other active package developers?
@minisciencegirl I can re-do the leaflet if there's interest.
@SimonGoring I think some within rOpenSci (not including me) recently took a pretty hard look at readthedocs as we (this time including me) were interested in whether more tools and automation around shoving R documentation into that format and site would be worthwhile. I think the conclusion was equivocal at best. I'm not sure of all the details, but pretty sure there's no decision to go full steam ahead on that. So I would suggest steering our sessions towards roxygen2, vignette writing, READMEs, example coverage, etc. There is so much to talk about just with the immediate tools at hand.
Alright, the people have spoken - our preferred time for the summer is a three-way tie between Tuesdays at 10 AM, or Fridays at 1 or 4 PM. I nominate Fridays at 1 as a good way to wind down the week, but speak up if there are any strong opinions for the other two favorites.
In which case, we can start booking these awesome sessions you've suggested! What's say we start not this Friday but next? Also, there have been a bunch of requests for more introductory level lessons from new students; @JoeyBernhardt is going to help organize a run through of the R Utility Belt you all helped outline last year - let us know if you're interested in teaching one of those!
All of which adds up to a summer schedule that looks like:
May 20 -
May 27 - R Utility Belt: ggplot / @lmguzman
June 3 - Coworking
June 10 - R Utility Belt
June 17 -
June 24 - R Utility Belt
July 1 - Canada Day Holiday
July 8 - R Utility Belt
July 15 -
July 22 - R Utility Belt
July 29 - Coworking
August 5 - R Utility Belt
August 12 -
August 19 - R Utility Belt
August 26 - Coworking
A couple of points:
Any other comments or requests, post 'em here - I'm psyched for this lineup, it's going to be a fun summer!
I can do ggplot for the R utility belt, but I will only be here until June 10 :)
@lmguzman awesome! Would you like the May 27 slot for ggplot?
Sounds good
How are the R Utility Belt modules saved? I could help contribute to the basic stats one, building it in RMarkdown through a shared repo if we want.
Also I told @minisciencegirl I could re-run the shiny/leaflet module if requested. I'm gone most of July, also for the Aug 12th and the June 17th :) I guess I could do the May 20 meeting. That seems soon since I just ran it last semester.
@BillMills Starting from July I can "broadcast" Machine Learning with Python you need just to specify the time that needs to be allocated for that (30 min? 1 hour ? more ? less ?). With it being online, you can feel free to spread the tutorials as you wish as I will be preparing a series of talks/demo online (youtube).
btw we probably need another issue with only the plan, as this one will go on and on quickly with comments :) @BillMills you agree ?
Fridays at 1 sounds good to me!
@SimonGoring: ggplot, dplyr + magrittr and tidyr + reshape2 have been taught previously and have repos for the notes; I believe @eacton has notes for knitr + rmd, and the rest are still dream stage - if you can help whip up multcomp + lm()
+ glm()
(or related), that would be awesome!
If those dates don't work, what about a coworking day? We can definitely drop / split a coworking to accommodate!
@radaniba that is awesome, I'm super psyched for this! We should definitely pull in Study Groups worldwide if all they have to do is tune in. Will they be live broadcasts so there can be a question period? Sounds good either way!
I would recommend <=1h. bite-sized 30 minute chunks would be even better, but as this is a pretty advanced topic, it might be hard to get where you want to go in only 30 min - but if you can, go for it!
SUPER IMPORTANT EDIT: Fill out this poll by May 6 to vote for the time you'd like to meet for summer session (May-August): http://whenisgood.net/zqr53wz
Hi folks,
We had an awesome lineup of sessions the past four months - but now that winter semester is wrapping up, it's time to wrangle a new set of lessons for the summer! So - what do people want to teach? What do people want to learn? Post suggestions here!