Open kescobo opened 3 years ago
Thanks for these good, solid ideas. Identifying a domain and a data-set seems a bit chicken-and-egg. But as you say, first issues #1 first, and learning objectives are the main game, then applying backward design.
A central resource for us generally should be the The Carpentries Curriculum Development Handbook, which has a particular section on Picking a dataset. There are ten (!) criteria to guide the process, so more ideas the better in terms of serving the learning objectives and matching up to the criteria.
FYI, my field is optimisation of energy systems, including working with databases of generators, transmission networks and associated time-series of energy generation, utilisation and demand. There may be suitable datasets in there or not. I'm fine working with data outside my area.
which has a particular section on Picking a dataset.
:sheepish: Probably a good idea to RTF(course development)M... thanks for pointing that out! I have a lot of reading to do :laughing:
Oh, so do I. I am only just familiarising myself with the "CDH" (as some abbreviate it to).
Here is a potentially interesting and neat data set: Genomic Outposts Serve the Phylogenomic Pioneers: Designing Novel Nuclear Markers for Genomic DNA Extractions of Lepidoptera
I know nothing about the field .... but everyone likes butterflies: Tree of Life: Nymphalidae. There are some great visuals and graphs, a fairly detailed appendix / supplementary data, with a mix of applicable data types (numbers, text, hierarchical types thereof). We would have to clarify that the dataset could be released under a CC0 license.
One last thing : the julia butterfly is a species of Nymphalidae ;)
Ha! Nice :-). Works for me, and I'd be happy to write up the biology sections. But the conclusions of that paper might be a bit advanced for a non-biologist. I'll dig in a bit more and see if I can come up with a more simplified version.
Though (in my opinion) less important than #1, we nevertheless need to identify what knowledge domain we will use as a backbone for the lesson. In PM, I wrote:
Despite that last point, a coronavirus - based project could definitely fulfill the first 2, especially since it could include both epidemiology as well as biological (sequencing) data types. I'm a little biased here though, as this is my field and I'm already developing some other materials along this line that I could double-dip :-D
Other possibilities: