UBC-STAT / stat545.stat.ubc.ca

Repository that produces the STAT 545 @ UBC website
https://stat545.stat.ubc.ca
Creative Commons Attribution 4.0 International
41 stars 83 forks source link

Ideas for next year #74

Open vincenzocoia opened 4 years ago

vincenzocoia commented 4 years ago

Starting a thread to put ideas for next year.

Tibble Joins lecture

General Idea

For the tibble joins lecture today, I did a little segue into a higher-level discussion about where tibble joins often show up in practice. I think it was useful for framing the topic, but it also brought up other aspects of writing a data analysis that's hard to treat as its own topic, such as using mass[1] vs unique(mass) to retrieve island land mass.

vincenzocoia commented 4 years ago

here::here() lecture

Definitely important to go over the notion of running code interactively vs. from source. Too hard to explain this at the same time as trying to explain why here::here() sometimes gives different things between the two.

EDIT: I should avoid this topic, because it is discussed later in 545B in the "Automation" topic.

vincenzocoia commented 4 years ago

Worksheet 1B

I think there might be too much emphasis on having functions print() things as opposed to just outputting a character vector.

vincenzocoia commented 4 years ago

Mini Data Analysis project

Instead of allowing them to work with any dataset, choose 3 that will definitely work nicely with the questions posed in the project.

If you are invested in a dataset of your own, encourage them to use it, but also describe it so that the teaching team can understand it. Perhaps even get them to put it in a data folder with a README describing each column -- or at least, the most important ones.

vincenzocoia commented 4 years ago

Mini data analysis milestone 2

Questions ended up being too specific -- would be more effective to generalise. This milestone was also too demanding for students.

vincenzocoia commented 4 years ago

Mini data anlysis milestone 3

People were copying their knitted md files into the output folder. Indicate not to do this (it's not reproducible anyway)

dy-lin commented 3 years ago

Feedback for Assignment 1-B in the private instructor repo: https://github.com/UBC-STAT/stat-545-instructor/issues/24

vincenzocoia commented 3 years ago

Assignment 1-B

vincenzocoia commented 3 years ago

Policies

vincenzocoia commented 3 years ago

Assignment 2-B

install_github(build_vignettes = TRUE, ref = "0.1.0")
vincenzocoia commented 3 years ago

General assignments

vincenzocoia commented 3 years ago

General topic flow

At least for 545B, where topics span 2 days, it may be useful to cut out some instruction on more details, and end with a 10-20 minute presentation on expanding the topic. This would allow me to at least plant the seed for topics that I wish we could cover in this course, like Rmd presentations, bookdown, GitHub pages, etc.

dy-lin commented 3 years ago

Assignment 4-B: Makefiles

Using /usr/share/dict/words as a dependency for words.txt is causing issues for Windows users. Looks like this has consistently been a problem in the past years. I advised the current students to adapt this issue into code, and download words.txt from here:

words.txt:
    Rscript -e "code to download words.txt here"

It probably doesn't even need to be R code-- I'd accept using wget or curl for the students savvy with the command-line too, although it is good exposure to RCurl.

UPDATE: I just checked this year's Makefile, and it looks like there was a commented out line for them Windows users. Maybe we could write extra comments in the Makefile to let the students know to use that line instead if they're on a Windows machine.

vincenzocoia commented 3 years ago

Assignment 4-B: Makefiles

This assignment is not Windows-friendly. Should teach remake instead of make, and just allude to make at the end.

vincenzocoia commented 3 years ago

Assignment 3-B:

vincenzocoia commented 3 years ago

Assignments 1 and 4

vincenzocoia commented 3 years ago

Assignment 5-B

strings assignment: there was an article that many people drew on for removing stopwords. It's OK to draw from it, but many people didn't cite it. I should use this as an example of how and when to cite code.

List the Gutenberg project as being a source of freely available books: https://dev.gutenberg.org/

vincenzocoia commented 3 years ago

Overall

Extensions:

Assignment feedback: there should be a document outlining procedures:

Improving the stat545 content:

Others:

Team leading:

dy-lin commented 3 years ago

Clarification on Assignment 4:

Overall clarifications:

dy-lin commented 3 years ago

Overall, I think there was quite a bit of 'hand-holding' in 545A, with the checklists etc. It shouldn't be our responsibility if the students miss certain criteria for not reading carefully. I also think that the students should be able to just download our .Rmd files for assignments, and not need us to create a pull request for each assignment. Although if this is the direction we want to continue on, and if the course coordinator is git-savvy, this could be one of their consistent tasks.

vincenzocoia commented 3 years ago

Here's a fun dataset that would be useful to introduce dplyr and ggplot2 -- an analysis of pride parade entries across time: https://github.com/GaytaScience/PrideParades

vincenzocoia commented 3 years ago

Yulia recommends ProjectTemplate as an alternative topic to Makefiles.