EstherPlomp / TNW-RDM-101

Self paced materials of the RDM101 course
https://estherplomp.github.io/TNW-RDM-101/
Creative Commons Attribution 4.0 International
6 stars 2 forks source link

Overview issue Olivier Witteveen #118

Closed olivierwitteveen closed 8 months ago

olivierwitteveen commented 9 months ago

Introduction

Hi, my name is Olivier and I am a PhD student in theoretical biophysics at the Bionanoscience department.

Describe your research in 2-3 sentences to someone that is not from your field

I use tools from information theory and statistical physics to understand biological systems, particularly gene expression. The expression of genes needs to be regulated precisely, especially when these genes encode proteins that are important for an organisms to develop in a healthy way. How do organisms maintain this regulatory precision, despite the noise in molecular signaling networks?

My research entails the following aspects:

Research Aspect Answer
Use/collect personal data (health data, interviews, surveys) No
Use/collect experimental data (lab experiments, measurements with instruments) Yes
Collaborate with industry No
Write/develop software as the main output of the project No
Use code (as in programming) for data analysis Yes
Work with large data (images, simulation models) Yes
Other: N/A

Reflections on the importance of RDM videos

I do share the anxiety that a small mistake in one's research leads to catastrophic consequences. I wouldn't want to be deceived by a mistake I made myself. This is, of course, where Open Science and reproducibility plays an important role -- one can always prove you had good intentions, plus having more eyes on the back-end of the project may prevent these mistakes in the first place. Further, losing data is always a scary concept but in principle everything I do should be backed up in the cloud. But perhaps during this course I will learn how to minimise this risk even further.

What would you like to learn during this course?

I find it difficult to organise my data and scripts in a way where I keep what is important, and remember what everything was for. Especially in early stages of research projects, one might try many things that you may not use in the end. But I'll still keep all the scripts in case it's useful later. I now have folders called "old" in most of my projects' directories that keeps all the out-dated things, but obviously this isn't a super clear way of keeping track. I do find myself digging in this folder surprisingly often, so I would like to learn a better way of handling this. Further, I'd be interested to learn how to better share and communicate code/data with collaborators.

Checklist assignments

AimeeKok commented 9 months ago

Hi Olivier,

some feedback for Assignment 2: I would recommend using GitHub for version controlling your notebooks (it is not overkill :P), though searching back through old commits can also be a hassle if you don't write proper commit messages (I'm very guilty of that lol). I'm not sure how iCloud specifically works, can you access it like a folder on your laptop? Because in that case you can set up a daily automatic backup to any other location (your OneDrive folder e.g.) with a backup tool like borg (Pietro showed me this, might be nice!)

EstherPlomp commented 9 months ago

Thanks for sharing assignment 2 @olivierwitteveen! And thanks also for the helpful feedback @AimeeKok! Version control is rarely an overkill indeed :)

I think this is a very nice/clear overview!

Some pointers/suggestions for your assignment:

maartjespaans commented 8 months ago

Hi Olivier, Thanks for sharing your data flow map. I especially appreciate your thinking about where to store your notes. It made me realize that I actually should also think about where and how I store my notes, so thanks for that! :)

AimeeKok commented 8 months ago

Hi Olivier,

some feedback for Assignment 3: Your folder structure is really clear! (and seems like it was already pretty well organized in the Folder Structure discussion haha) Also with the documentation it seems you were already implementing a lot of good practice and the additions will only help make it better. Also not from your side but the experimentalist side, I wonder why they just not use .csv files in the first place? Anyway, everything else looks good as well. Great job! 👍

maartjespaans commented 8 months ago

Hi Olivier, Thank you for sharing your dataflow map. I think you have a proper and well organized data organization. One advice for your naming convention, if you use the order YYYYMMDD, your files will be automatically ordered by date. For me that works best to keep my folders organized. Hope this is useful. :)

EstherPlomp commented 8 months ago

Thanks for sharing assignment 3 @olivierwitteveen! It again looks very nice, clear and comprehensive: well done! Especially the folder structures as Aimee and Maartje are highlighting! Thanks both for your feedback as well!

I therefore have very little feedback: