rasilab / github_demo

Demo GitHub repo for laboratory research
MIT License
6 stars 1 forks source link

Transform and library prep fk8 DMS library #2

Open kychen37 opened 1 year ago

kychen37 commented 1 year ago

GanttStart: 2023-04-20

Background

In https://github.com/rasilab/github_demo/issues/3 and https://github.com/rasilab/rqc_aggregation_aging/issues/101, we identified and validated FK as a stalling dipeptide. In https://github.com/rasilab/rqc_aggregation_aging/issues/117 we identified other possible targets of RQC (FK and others) using the 8x dicodon (pHPSC1142) library in hel-del cells. In https://github.com/rasilab/rqc_aggregation_aging/issues/119, we identified endogenous genes with FK/etc stalling type motifs. In https://github.com/rasilab/rqc_aggregation_aging/issues/120 we designed libraries to test these endogenous motifs and DMS libraries In https://github.com/rasilab/rqc_aggregation_aging/issues/121 we cloned these libraries as a pool and in https://github.com/rasilab/rqc_aggregation_aging/issues/122 I transformed/library prepped them. In https://github.com/rasilab/rqc_aggregation_aging/issues/124 we analyzed the deepseq data and found that the FK8 DMS library had very little representation in the library. In https://github.com/rasilab/rqc_aggregation_aging/issues/129 we re-cloned the fk8 library alone (oKC224) into the library reporter without barcodes (pHPSC1163). Colony PCR + sangseq indicates that only ~12.5% of this library can be expected to be correct, so 6-8K (over 1184 variants) while 40-60K are either parent or have an incorrect insert.

Here I will tranform into WT and Hel2-del yeast and prepare libraries to redo this sequencing experiment.

Strategy

Experiment Links

first replicate: https://github.com/rasilab/rqc_aggregation_aging/blob/master/experiments/kchen_exp74_redo_fk8_dms_deepseq.md

second replicate: https://github.com/rasilab/rqc_aggregation_aging/blob/master/experiments/kchen_exp75_redo_fk8_dms_deepseq_rep2.md

library prep: https://github.com/rasilab/rqc_aggregation_aging/blob/master/experiments/kchen_exp76_library_pre_fk8_dms_replicates.md

snapgene map: https://github.com/rasilab/snapgene_maps/blob/master/DNA%20Files/lab_database/kchen/illumina_amplicons/IKCSC12_R1_fk8_umi_R2.dna

Brief conclusion

In the analysis we felt there was evidence of RT/PCR bias because T/C containing codons were preferentially low in some of the conditions. Other conditions did not have this bias and looked good. I tried redoing the library prep using the correct primer (https://github.com/rasilab/rqc_aggregation_aging/issues/135#issuecomment-1597885161, https://github.com/rasilab/rqc_aggregation_aging/issues/135#issuecomment-1603766677), but the RT and noRT samples no longer separated for most samples. Since the mRNA and gRNA samples are prepped the exact same way with regards to RT and PCR, any bias should normalize out, so we decided not to redo and kept the data from this run.

Checklist before closing issue

kychen37 commented 1 year ago

I was thinking about our conversation re replicates and I think it's worth doing 2 more biological replicates (aka two more transformations) so there are 3 total replicates. That will be 12 libraries total and ~60million reads, let me know if you don't think it's worth it @rasi

rasi commented 1 year ago

@kychen37 Yes, that is a good idea if it is not too much work.

kychen37 commented 1 year ago

Update

@rasi I plan to library prep both replicates next week following strategy outline above, 4 libraries per replicate x2 = 8 libraries total. These should be ready by end of next week.

kychen37 commented 1 year ago

Update

@rasi The cDNA samples that had RNA as input took more diluting to get reasonable CTs than I was expecting (usually a 1:8-1:16 dilution is good enough for getting CTs <20, but this time requires at least a 1:32 dilution). With this dilution the PCR reaction will be very large, so I have to go through the logistics of the library prep in more detail, so I think these will actually be done closer to end of next week

rasi commented 1 year ago

@kychen37 I am not sure I understand the comment above. Does this mean you have more RNA going as input into RT, in which case you can get away with smaller PCR reactions?

kychen37 commented 1 year ago

Update

@rasi

kychen37 commented 1 year ago

Update

@rasi final libraries:

label cDNA sample RNA cycles variants target/variant target ng/uL
76lib1 76cDNA1 WT RNA 17 1024 10,000/variant 10 million 5.76
76lib2 76cDNA3 hel2 RNA 17 1024 10,000/variant 10 million 4.42
76lib3 76cDNA5 WT RNA rep2 17 1024 10,000/variant 10 million 4.2
76lib4 76cDNA7 hel2 RNA rep2 17 1024 10,000/variant 10 million 2.46
76lib5 76cDNA9 WT gRNA 10 1024 10,000/variant 10 million 0.634
76lib6 76cDNA11 hel2 gRNA 10 1024 10,000/variant 10 million 0.848
76lib7 76cDNA13 WT gRNA rep2 10 1024 10,000/variant 10 million 0.878
76lib8 76cDNA15 hel2 gRNA rep2 10 1024 10,000/variant 10 million 0.95
Screenshot 2023-05-30 at 3 35 44 PM tapestation_5_24

I was initially concerned that the double band in the tapestation might be incomplete libraries, but even when I tried doing a final extension of 15m at 72C the double band persisted. The double bands are close enough in size that the actual tapestation quantification doesn't say there are two bands (which it did when there were incomplete libraries in my previous runs). So I don't think the double bands are incomplete libraries, they are likely the spike-ins I used, since the 48nt inserts in these spikeins were truncated they are slightly smaller than the real library. Because there are only 1024 variants in this library, I spiked in at higher concentration than I previously have (1/1024 vs 1/200,000), which is why I think they are showing up on tapestation.

Overall I think these libraries are ready to sequence. https://github.com/rasilab/protocols_tutorials/issues/80#issuecomment-1569408268