Open pombase-admin opened 15 years ago
Original comment by: ValWood
The gap-filling sequence on the left arm of chromosome 2 in fission yeast Schizosaccharomyces pombe Yeast Volume 25, Issue 9, Date: September 2008, Pages: 673-679 Mayumi Sasaki, Alimjan Idiris, Aya Tada, Hiromichi Kumagai, Yuko Giga-Hama, Hideki Tohda
was there another gap filling sequence?
Original comment by: ValWood
Include i) all frameshifts/ sequence errors recorded ii) versioning 27D7 repeat
Original comment by: ValWood
notes transferred from issues/27
• Hideki, gap filling ? • Sasaki M, Idiris A, Tada A, Kumagai H, Giga-Hama Y, Tohda H. Yeast. 2008 Sep;25(9):673-9.PMID: 18727152 new telomeric clone pending http://www.sanger.ac.uk/Projects/S\_pombe/sequence\_updates.shtml http://www.sanger.ac.uk/Projects/S\_pombe/sequence\_discrepancies.shtml
from trac ticket: https://sourceforge.net/apps/trac/pombase/ticket/30
read in UTR files to ensure sequence updates propagated
This is in PMID: 24929437 should already be in be in the main changes
We detected translation of the C terminal part of the Nup184 protein, which was not predicted to be translated in S. pombe due to an in-frame stop codon created by a single nucleotide deletion32. Translation of this region in the strains we used was explained by the presence of single nucleotide insertion that reverted the effect of the deletion in the reference sequence (Supplementary Fig. 5A and 5B).
From https://www.ebi.ac.uk/panda/jira/browse/PB-507:
Make sure AB325691 contig gets incorporated into chromosome 2.
(Full text of ticket description: "This small contig appears as chromosome AB325691 http://www.pombase.org/spombe/result/SPBC460.05 it is actually a small chromosome 2 contig (left end I think) How easy is it to add a chromosome 2 assignment (in the next assembly this will be built into the chromosome so I have put this at low priority)")
The sequence updates will need to be low priority. They won't be able to happen until we have a 'lift over' equivalent ...world of pain...
Very partial list in random order:
High priority datasets to host for refining gene structures: Lantermann data hosted https://github.com/pombase/website/issues/62 Dutrow https://github.com/pombase/website/issues/63 CAGE data? (Olaf) https://github.com/pombase/website/issues/368
Other tasks for after sequence changes: Fixing alleles in genes where sequence has changed Remapping existing datasets with lift-over Update EMBL files -> RNA central check Folder on desk top check Folder in e-mail check Open tickets. check actions from planning meeting Agenda
Identified sources of sequence revisions Make revisions to sequence proposed by broad and confirmed by Li-Lin DU and ??? Revise gene structures accordingly Merge in telomeric contig revise UTRS and non coding RNA structures etc Issues
What to do about mating type region fo reference strain What to do about mozaic nature of sequence (problems for EMBL submission) Making and EMBL dump, ensuring the RNAs and Protein info required by UniProt? and RNA central is present etc planning to synchronise with EG
also a list of SNPs from Megan Behringer, I put Ys in who identified them first. The Y’s in the Broad column are all the mutations shared between my two isolates that were first identified as errors by the broad. Then the second column are errors not identified by Broad but identified by Hu et al. The third column are SNPS not previously reported by Broad or Hu et al, and are newly identified in our study.
"mutation accumulation study in S. pombe (Behringer and Hall, 2016) and am currently finishing up a review article comparing mutation results in our study and a study by Farlow et al (2015). In comparing the ancestors used in both our studies, not only did I identify 171 differences shared in our ancestors that were present in the list of 190 discrepancies from the Broad sequence reported on your 'Sequence Updates Pending' page. I also identified an additional 36 differences shared in our ancestors that were not included in your list. Since the other study was done in an H+ line and our study was performed using 972 h-"
Add new telomeric region to contigs (are there 2 now?)
Do fixes small which are confirmed http://www.sanger.ac.uk/Projects/S\_pombe/sequence\_updates.shtml http://www.sanger.ac.uk/Projects/S\_pombe/sequence\_discrepancies.shtml
Can we fill any gaps using Broad data? Speak to Nick Rhind
To Do Give community advance notice of contig changes Make Fixes Update Stats/ webpage (sequencing status etc)/ download data EMBL resubmission
Original comment by: ValWood