genophenoenvo / neon-datasets

Repository for code and derived data from NEON data products
MIT License
0 stars 4 forks source link

Curate and standardize black cottonwood phenophases #56

Closed jessicaguo closed 2 years ago

jessicaguo commented 2 years ago

Follow up to #50

jessicaguo commented 2 years ago

I explored the Placerville 2010 Bud Set data, which have date and 3 sets of sampling dates. However, even restricting to the most data-rich genotypes (with 9 points or more), only some span the range of the bud set levels (1-6). image.png

I am not sure how to derive a response variable from these data. >1000 genotypes were planted at this site, but genotypes were only replicated 3 times (blocks). 474 genotypes were sampled 3 times, which yields a maximum of 9 data points from which to derive a phenotype. @dlebauer, any thoughts on how to proceed?

KristinaRiemer commented 2 years ago

We did talk the other day about one option, of binning all the responses into two categories, e.g., bud set values below 3 vs bud set values 3 and above. We just need some sort of justification for choosing what those categories are.

jessicaguo commented 2 years ago

For phenophase data, I feel we need some kind of date as a response variable. It's also not great that the plants were surveyed at most 3 times, because they clearly missed the transition period for some genotypes.

At our meeting yesterday, I asked Kevin to look into grouping the genotypes into haplotypes or populations. I also think I need to schedule a meeting with Anne about the data sooner rather than later.