IeDEA-SA / WG-open-science

IeDEA-SA open science working group
Other
0 stars 0 forks source link

Short info on the use of synthetic datasets #16

Open RPanczak opened 3 years ago

RPanczak commented 3 years ago

As an avenue for sharing IeDEA data

Solutions:

  1. synthpop package

  2. synthetic function from WORCS package

Connect with UCT on that issue since they were/are some attempts to solve this issue on their side.

elianerohner commented 3 years ago

I'm attending a tutorial on "WORCS: A Workflow for Open Reproducible Code in Science" by https://github.com/cjvanlissa - super interesting! @RPanczak Have you heard about WORCS? There's an R package which also creates a synthetic dataset as one of the steps. Here's a link to their paper: https://osf.io/zcvbs/

elianerohner commented 3 years ago

Here a short introduction to WORCS: https://www.youtube.com/watch?v=ysOxHYUWdFY

RPanczak commented 3 years ago

Thanks! Looks really interesting ❤️

There have been couple of initiatives similar to that, sometimes under name of 'research compendiums' (https://github.com/benmarwick/rrtools, https://github.com/cboettig/template, https://jdblischak.github.io/workflowr/index.html or even hacking it from scratch - https://sharla.party/post/usethis-for-reporting/).

The one you mention has some interesting shortcuts that could be super useful - just looking at that sounds very tempting as it automates several steps!

There are two problems iwht adoption tho - WORCS like other solutions heavily relies on git & github and we have some work to do in the team first in order to make git happen ;) Second thing that often happens is that manuscript are written in markdown. Wonderful solution but convincing team members to adopt that, particularly senior ones is a real struggle toward which I have not been able to find a solution! 🙉 🙈 🙊

Not to discourage too much tho - if you find WORCS useful would be happy to give it a spin in one of the projects? 💪 💪 💪

As for synthetic data, the function manual points to this paper which i think uses this package. I had it on the list of things to review so good reminder to finally get back to it!

elianerohner commented 3 years ago

Ah, great to see these other initiatives! I'm just a newbie stumbling around in the open science world 🙃 However, once I've accumulated some skills, I'd be more than happy to help putting efforts into convincing the team 🤓 Step by step we'll climb the ⛰️