catalystneuro / leifer_lab_to_nwb

Conversion scripts for the Leifer lab. Includes the publication Neural signal propagation atlas of Caenorhabditis elegans (Nature, 2023).
BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Subject metadata #11

Closed CodyCBakerPhD closed 2 weeks ago

CodyCBakerPhD commented 3 months ago

@emosb Related to the general question in #10, but there are many other fields NWB would like to have about each of the subjects used in the dataset

For each subject, we would need to have

We would also appreciate

emosb commented 3 months ago

Generally somewhere between 2 and 5 subjects are taken per day, so an explicit counter or subject_id could be added. Unless otherwise stated, the sex is Hermaphrodite. I put a "Male" tag in the logbook for all of my data on male worms - I can put the specific path to this information in a later comment. Age is included in the same logbook for all subjects. Cultivation temperature for all subjects is 20 degrees Celcius. Could you please clarify growth_stage fields, as I am unsure how this differs from age.

CodyCBakerPhD commented 3 months ago

Cool, sounds like I just need this logbook then

The descriptions of those last fields are

emosb commented 2 months ago

The logbook.txt file should be included now in the Globus data. The logbook.txt reflects the entire day of recordings (i.e. multiple subjects). A new log is appended each time the multicolor program is run to get the multicolorworm_###### data, with user input data like the strain name, days on dex, and growth stage of worm. We do not record the amount of time in the current growth stage, but for us YA indicates at most one day post L4 (for hermaphrodites we look for non-eggy adults, indicating less than 16 hours since their L4 molt).

CodyCBakerPhD commented 2 months ago

@emosb Excellent, thank you - this looks good to me

One question - as you continue to upload data for all the subjects used in the paper, can the outer folders follow the convention I see on the D:\Data drive mentioned in the logs? (that is, %y%m%d/...)

This will make it easier to iterate over sessions during the conversion

CodyCBakerPhD commented 2 months ago

@emosb Pinging on this again - any progress on uploading the data for the other subjects used in the paper?

emosb commented 2 months ago

I will start transferring other subjects - I will restrict to hermaphrodites for the moment that are included in the Nature data. I will name the outer folders with the %y%m%d convention as well with the subject number indicated within the inner folders. As they are transferred please let me know if you want me to modify the naming convention for the subject folders

emosb commented 2 months ago

To clarify, do you still want these to be partitioned into raw and post-processed files. This is the most time limiting step for me. The original format of the data (and the quickest to transfer) has the %y%m%d format for the outer folder already, but does not have inner folders labeled with subject_id

CodyCBakerPhD commented 2 months ago

To clarify, do you still want these to be partitioned into raw and post-processed files. This is the most time limiting step for me. The original format of the data (and the quickest to transfer) has the %y%m%d format for the outer folder already, but does not have inner folders labeled with subject_id

I suppose just transfer 'as-is' for now with minimal alterations; as long as both the raw and post-processed data are included under some indicator of what constitutes an associated session I can hopefully figure it out or if not ask for more details in order to figure it out

emosb commented 2 months ago

Alright, I have transferred another 4 subjects - labeled with subjectid#, where the # corresponds to the order of the published data. Each subject_id folder contains the multicolor and pumpprobe folders for the given subject, though not partitioned into raw and preprocessed. I will continue transferring further data - but let me know if there are alterations to the formating or clarification needed.

CodyCBakerPhD commented 2 weeks ago

Will be solved via the upcoming YAML compilation in #16