swulfing / STOICH.Aim1

2 stars 2 forks source link

What variables to keep #5

Closed diatomdaniel closed 2 years ago

diatomdaniel commented 2 years ago

I was reviewing Linnea's preliminary data analysis for the current data (thanks Linnea, great job) and wanted to start a discussion re. which variables we keep. Linnea computed the % occurrence of all variables in the data set and following variables have an occurrence of <5%: PN, POC, PP, TC, DND, TDP, TPC, TPN and TDP; other variables such as Fe, Manganese, NH4, TOC, TON occur less than 50% of the time. I think for the sake of any analyses we run, we should discuss removing rare/uncommon variables. This would also make data wrangling, etc. easier going forward. Can you bring this up at today's meeting please (03/25/22) @mcstreamy? Thanks

LinneaRock commented 2 years ago

Agreed!

mcstreamy commented 2 years ago

Will do @diatomdaniel

diatomdaniel commented 2 years ago

I'm gonna close this issue as I am assuming we will only really work with NO3, DOC and PO4 (maybe TN, TP). Given the difficulties we are having modelling the data atm I don't think including more variables is a good idea (for now),