lizzieinvancouver / ospree

Budbreak review paper database
3 stars 0 forks source link

checking two outlier species #427

Open lizzieinvancouver opened 2 years ago

lizzieinvancouver commented 2 years ago

@MoralesCastilla I wanted just to see if anything was weird with the two species that are outliers.

I ran your phylo_ospree_plots.R code until the ff dataframe then:

unique(ff$datasetID)
> [1] "man17"      "nanninga17"
t <- subset(d, datasetID=="man17" | datasetID=="nanninga17")
ggplot(t, aes(x=chill.hrs, y=resp, group=spps, color=spps))+geom_point()+geom_line()+facet_wrap(.~datasetID)

And the plot is: quickplot.pdf

I didn't save the code or plot beyond here, but I can see how these species could seem extreme given that they only show up in these two studies (and Man17 has pretty extreme responses). I don't see anything 'off' about the data, but those 200 day values are pretty extreme so I looked back at the paper and found this:

For seedlings that did not flush throughout the experiments, a maximum heat accumulation was calculated from the start of the experiment to May 31 when trees growing outdoors had begun flushing.

I think maybe we should not use any points with maximum heat accumulation, but I am not sure how to find those data points!

I found this detailed issue #361 so I was hoping @cchambe12 or @dbuona might remember something? I honestly don't so I think we could also use this as a reason to just exclude these data in this analyses and put the other plots (with these two outliers) in the supp.