EstherPlomp / TNW-RDM-101

Self paced materials of the RDM101 course
https://estherplomp.github.io/TNW-RDM-101/
Creative Commons Attribution 4.0 International
6 stars 2 forks source link

Assignment 1 Pablo Velazquez #63

Closed PabloVelazquezGarcia closed 1 year ago

PabloVelazquezGarcia commented 1 year ago

Introduction

Hi there! I am Pablo Velazquez and I am a PhD candidate in the biotechnology department and in the biocatalysis group. I Love sports and cooking :D

Describe your research in 2-3 sentences to someone that is not from your field (please avoid abbreviations)

My research focuses on the design and testing of new enzymes with potential applications in chemical and pharmaceutical processes. Using protein engineering tools, we aim for a new and greener and more sustainable chemistry.

My research entails the following aspects:

Research Aspect Answer
Use/collect personal data (health data, interviews, surveys) /No
Use/collect experimental data (lab experiments, measurements with instruments) Yes/
Collaborate with industry Yes/
Write/develop software as the main output of the project /No
Use code (as in programming) for data analysis /No
Work with large data (images, simulation models) Yes/

Reflections on the importance of RDM videos

What would you like to learn during this course?

I would like to learn how to manage different kinds of data, in different formats in a easy and intuitive way.

Checklist assignments

Checklist assignments

PabloVelazquezGarcia commented 1 year ago

Assignement 2

https://surfdrive.surf.nl/files/index.php/f/13024904754

PabloVelazquezGarcia commented 1 year ago

Assignement 3 Step 2

  1. What folder structure will you use for your project? (and how does the data fit in there) The folder structure that I am following is a hierarchy structure with NameProject as the main folder, followed by 5 subfolders (Bibliography, Experiments, RawData, Protocols, Miscelaneous). In "Experiments" and "RawData" there are more subfolders arranged by experiment.
  2. What naming convention are you going to use? (show us an example based on your expected dataset) YYYY-MM-DD_Experiment_Particularity_Version
  3. Additional remarks I tend to change the conventions every now and then. Please dont Step 3
  4. What type of documentation do you need to generate/write? (metadata, data collection process, data dictionary, code versioning, etc.) In my experiments I will need to generate data related with genetic sequencies, 3d models of enzymes, kinetics, graphs and spectroscopy data. I will be creating a data dictionary for my reactions (each reaction has a reaction code, which makes it easier to come back for the data later on.
  5. Will you use a metadata standard to document your data? For now, I dont think I will be needing this. I feel the simplest and clearest way of keeping and organizing my data is the best for me 3.What tools are helpful to generate/write the documentation needed? I use both a notebook and a electronic lab journal. I also write down specific codes for different experiments so that I can easily know what is the experiment about and I can find it later on (also new people can).

Step 4

  1. Are your files in an open file format? Or, is the code in an open programming language? My files are totally open format
  2. Can you convert the proprietary file formats to an open file format? If yes, to which open file format? Is there anything more open than excel, word, pdf...?
  3. If the data/code is in a proprietary format, what information/software would others need to re-use the data/code?

Step 5

  1. Who will have access to this dataset during the project? Everyone in my research group

  2. If others than you will have access to the dataset during the project, how will you share the data? In the shared drives of the TU Delft (M)

  3. At the end of the project this dataset can be: Open - no confidential information in it , it can be published In which repository will you publish it? 4TU.RESEARCHDATA

PabloVelazquezGarcia commented 1 year ago

Assignement 3 https://surfdrive.surf.nl/files/remote.php/webdav/RDM101_Assignment1_Week1_DataFlowMap_PabloVelazquez.pptx https://surfdrive.surf.nl/files/remote.php/webdav/PABLOVELAZQUEZ_DATASET_Readme.txt

EstherPlomp commented 1 year ago

Thanks for sharing part of your assignment 3 @PabloVelazquezGarcia! I can't access the files you shared via SURFdrive - can you check again whether you have set up a public link to share the files? (See here for the steps you'll need to take). The same comment still applies to your assignment 2 as well. Please let me know if we need to set up a quick call with some screensharing to get this sorted!

Regarding your comment on open files: word/excel files are not an open format because you'll need a microsoft license to fully interact with these file types. Alternatives would be .txt or .md files for word, or .csv for Excel!

And good to see you're already documenting everything via (electronic) lab notes!

PabloVelazquezGarcia commented 1 year ago

Assignment 3 (password: datamanagement)

https://surfdrive.surf.nl/files/index.php/s/JrFP3v7TVOAjmyn https://surfdrive.surf.nl/files/index.php/s/Z0ClxI87ncsQsyF

EstherPlomp commented 1 year ago

Thank you @PabloVelazquezGarcia! These links now work! It looks like these are your assignment 2 and readme file. Please also provide a link to assignment 3, and then I can provide feedback later today (hopefully - otherwise tomorrow :)).

EstherPlomp commented 1 year ago

And the feedback! Hope it is helpful:

Assignment 2:

README: