Closed ewallace closed 3 years ago
Another problem with this is that it takes several minutes to run the codon_count
code earlier in the script, which creates a major incentive to not debug it.
Best to fix this by computing codon counts in a different script, count_codons.R
, and just reading in the table to train_half_life_linear_model.R
.
I have attempted to address in commit bcb319b, branch train_halflife_problems. Let's discuss.
Sam will check the train_halflife_problems branch and add if there are no bugs
I could not get
train_half_life_linear_model.R
to run in the current master branch. There were a lot of problems and this needs to be fixed soon.The problems start at around line 119 (master):
There are a series of problems:
transcriptName
andgeneName
used inconsistently; fix by picking one and using throughoutmutate(UTR3_length = str_length(threePrimeUTR))
is applied to a tibble that does not havethreePrimeUTR
; probably fix by defining earlier inyeast_3UTRs
and passing forwards to other tibbles that use it