Closed ValWood closed 2 years ago
@kimrutherford
please could you redo me the query to get the list above (all exists_during extensions)
but this time, exclude any which are in this branch of GO? GO:0022403 cell cycle phase
Did I create that file (exists_during.txt) as part of an issue? Do you have the issue number?
I think I made that file from the GO GAF...
Ah, OK. I'll was going to re-use my previous query if there was one.
No problem, I'll have a go at a Chado query for it.
Does this have what you need?: exists_during_annotations.txt
Note to self, here's the query:
select f.uniquename, t.name
from feature_cvterm fc join feature f on f.feature_id = fc.feature_id
join cvterm t on t.cvterm_id = fc.cvterm_id
where t.name like '%[exists_during]%' order by t.name;
Perfect. It does not omit the ones in the cell cycle phase branch, but because I can see the terms I can easily remove those manually in a couple of minutes.
[ ] annotations with [exists_during] meiotic cell cycle (GO:0051321) need to migrate to "[exists during] meiotic cell cycle phase (GO:0098762)" (90)
[ ] annotations with [exists_during] mitotic cell cycle (GO:0000278) need to migrate to "[exists during] mitotic cell cycle phase (GO:0098763)" (167)
[ ] annotations with [exists_during] meiosis II (GO:0007135) need to migrate to "[exists during] meiosis II cell cycle phase (GO:0098765)" (12)
[ ] annotations with [exists_during] meiosis I (GO:0007127) need to migrate to "[exists during] meiosis I cell cycle phase" (GO:009876) (13)
Would it help to know which curation session each annotation comes from?
Yes but I was going to ask if these could be migrated automatically. Is that possibe/easy. It will take a long time to change these. Most will need to be looked at, or it will be as quick to do manaually.
If so I will open a chado ticket to deal with this.
Yes but I was going to ask if these could be migrated automatically.
Migrated to a different extension? We can do that automatically
If so I will open a chado ticket to deal with this.
It's a Canto issue really.
automatic migration moved to https://github.com/pombase/canto/issues/2141
If you could rerun this query providing the session links.
Is it possible to rerun this query providing the session links?
and excluding any descendants of "GO:0022403 cell cycle phase" these ones I need to fix are a subset of this reduced list
I can fix these now
[x] SPBC14F5.08.1 | nuclear euchromatin [coincident_with] SPAC23C11.16-promoter [exists_during] M phase -> mitotic M-phase
[x] SPAC23H4.17c.1 | nuclear euchromatin [coincident_with] SPAC6G10.12c-promoter [exists_during] M phase
[x] SPBC14F5.08.1 | nuclear euchromatin [coincident_with] SPAC821.08c-promoter [exists_during] M phase
[x] SPAC23H4.17c.1 | nuclear euchromatin [coincident_with] SPAC821.08c-promoter [exists_during] M phase
[x] SPCC1223.06.1 | cytoplasmic microtubule plus-end [exists_during] interphase WHICH ONE
[x] SPBC25B2.07c.1 | cytoplasmic microtubule [exists_during] interphase WHICH ONE?
[x] SPBC12D12.01.1 | inner plaque of mitotic spindle pole body [exists_during] M phase WHICH ONE
[x] SPBC21.06c.1 | mitotic spindle pole body [exists_during] cellular response to latrunculin B (is this a normal situation)?
@kimrutherford don't do this yet. I will fix these few first.
I think I have done most of what needed doing here. @kimrutherford could you run the script above again and excluding any descendants of "GO:0022403 cell cycle phase" so I can see what is left.
No hurry.
so I can see what is left.
There are quite a few: exists_during_terms_2.txt
Note to self:
select distinct f.uniquename, t.name
from feature_cvterm fc join feature f on f.feature_id = fc.feature_id
join cvterm t on t.cvterm_id = fc.cvterm_id
join cvterm_relationship ext_rel on ext_rel.subject_id = t.cvterm_id
join cvterm ext_rel_type on ext_rel.type_id = ext_rel_type.cvterm_id
join cvterm obj on ext_rel.object_id = obj.cvterm_id
where ext_rel_type.name = 'exists_during'
and obj.cvterm_id not in
(select subject_id from cvtermpath p
join cvterm o on p.object_id = o.cvterm_id
join cvterm pt on p.type_id = pt.cvterm_id
where pt.name = 'is_a' and o.name = 'cell cycle phase')
order by t.name;
I think the original script also excluded "response to" terms. So eyeballing it looks like most are now "phases" or "response to".
To fix to phases, responses This will be more consistent. For example the first one has
Having all as phases would be better. I might leave a few (for instance I allowed phase transitions, since although they are not in the phase branch they describe time points when used as extensions)
[x] SPBP19A11.04c.1 | cell cortex of cell tip [exists_during] mitotic cytokinesis SPBP19A11.04c.1 | cell division site [exists_during] mitotic cytokinesis SPBP19A11.04c.1 | medial cortex [exists_during] septum digestion after cytokinesis
SPAC17G8.12.1 | cell division site [exists_during] actomyosin contractile ring contraction SPAC630.03.1 | cell division site [exists_during] cytokinesis SPAC24B11.11c.1 | cell division site [exists_during] mitotic actomyosin contractile ring contraction SPAC1834.06c.1 | cell division site [exists_during] mitotic cytokinesis SPAC23C4.12.1 | cell division site [exists_during] mitotic cytokinesis SPAC2C4.14c.1 | cell division site [exists_during] mitotic cytokinesis SPAC821.12.1 | cell division site [exists_during] mitotic cytokinesis SPBC17F3.02.1 | cell division site [exists_during] mitotic cytokinesis SPBC3H7.15.1 | cell division site [exists_during] mitotic cytokinesis
SPCC970.04c.1 | cell division site [exists_during] mitotic cytokinesis SPCC645.06c.1 | cell division site [exists_during] mitotic intra-S DNA damage checkpoint signaling SPAC1006.06.1 | cell tip [exists_during] mitotic intra-S DNA damage checkpoint signaling SPAC1952.05.1 | chromatin [coincident_with] MCB [exists_during] mitotic intra-S DNA damage checkpoint signaling SPBC342.06c.1 | chromatin [coincident_with] MCB [exists_during] mitotic intra-S DNA damage checkpoint signaling SPAC30D11.10.1 | chromatin [coincident_with] origin_of_replication [exists_during] replication fork processing SPBC29A10.03c.1 | chromatin [exists_during] DNA replication-dependent nucleosome assembly SPCC16A11.17.1 | chromatin [exists_during] exit from mitosis SPBC106.01.1 | condensed chromosome kinetochore [exists_during] positive regulation of mitotic DNA damage checkpoint SPBC20F10.06.1 | condensed nuclear chromosome kinetochore [exists_during] mitotic spindle assembly checkpoint signaling SPAC1F3.02c.1 | division septum [exists_during] mitotic cytokinesis SPAC1F7.04.1 | division septum [exists_during] mitotic cytokinesis SPBC543.07.1 | division septum [exists_during] mitotic cytokinesis SPBC32F12.04.1 | equatorial microtubule organizing center [exists_during] mitotic telophase [exists_during] mitotic cytokinesis SPBC32F12.04.1 | mating projection tip [exists_during] agglutination involved in conjugation with cellular fusion SPAC110.03.1 | mating projection tip [exists_during] cell morphogenesis involved in conjugation with cellular fusion SPAC16E8.09.1 | mating projection tip [exists_during] cell morphogenesis involved in conjugation with cellular fusion SPAC22H10.07.1 | mating projection tip [exists_during] cell morphogenesis involved in conjugation with cellular fusion SPBC19G7.05c.1 | mating projection tip [exists_during] cell morphogenesis involved in conjugation with cellular fusion SPAC29A4.05.1 | mating projection tip [exists_during] conjugation with cellular fusion SPAC3A12.14.1 | mating projection tip [exists_during] karyogamy involved in conjugation with cellular fusion SPCP1E11.04c.1 | medial cortex [exists_during] cytokinesis SPAC4F10.11.1 | medial cortex [exists_during] mitotic cytokinesis SPAC631.01c.1 | medial cortex [exists_during] mitotic cytokinesis SPBC1289.04c.1 | medial cortex [exists_during] mitotic cytokinesis SPBC1778.06c.1 | medial cortex [exists_during] mitotic cytokinesis
SPAC1093.06c.1 | meiotic spindle astral microtubule [exists_during] karyogamy SPBC11B10.09.1 | meiotic spindle pole body [exists_during] karyogamy involved in conjugation with cellular fusion SPAC16C9.07.1 | mitochondrion [exists_during] conjugation with cellular fusion SPAC1F5.04c.1 | mitotic actomyosin contractile ring [exists_during] mitotic cytokinesis SPAC17G8.10c.1 | mitotic spindle pole body [exists_during] mitotic spindle assembly checkpoint signaling SPAC27D7.03c.1 | nuclear chromosome [exists_during] meiosis I SPAC140.02.1 | nucleolus [exists_during] mitotic cell cycle SPBC36B7.08c.1 | nucleoplasm [exists_during] mitotic cell cycle SPBC3B8.10c.1 | nucleus [exists_during] cell quiescence following G1 arrest due to nitrogen limitation SPAC27D7.03c.1 | nucleus [exists_during] conjugation with cellular fusion SPAC3H8.10.1 | nucleus [exists_during] conjugation with cellular fusion SPBC11B10.09.1 | nucleus [exists_during] karyogamy involved in conjugation with cellular fusion SPAC24B11.06c.1 | nucleus [exists_during] maintenance of stationary phase SPBP8B7.30c.1 | nucleus [exists_during] maintenance of stationary phase SPBP8B7.30c.1 | nucleus [exists_during] meiotic cell cycle SPCC645.07.1 | nucleus [exists_during] mitotic intra-S DNA damage checkpoint signaling SPBC146.13c.1 | plasma membrane [exists_during] dynein-driven meiotic oscillatory nuclear movement SPCC16A11.17.1 | rDNA protrusion [exists_during] exit from mitosis
The results from the SQL I included in the comment (https://github.com/pombase/curation/issues/2535#issuecomment-558481167) includes "response to" terms. Would you like me to exclude them?
No it's OK, I have my list now ;) Assigned back to me.
I've added that. Look out for a log file ending in .exists_during_extensions_not_in_cell_cycle_phase
both queries should not include:
GO:0044848 biological phase (which includes cell cycle phase and "single-celled organism vegetative growth phase") AND "GO:0050896 response to stimulus"
How's this?:
Note to self:
select distinct f.uniquename, t.name
from feature_cvterm fc join feature f on f.feature_id = fc.feature_id
join cvterm t on t.cvterm_id = fc.cvterm_id
join cvterm_relationship ext_rel on ext_rel.subject_id = t.cvterm_id
join cvterm ext_rel_type on ext_rel.type_id = ext_rel_type.cvterm_id
join cvterm obj on ext_rel.object_id = obj.cvterm_id
where ext_rel_type.name = 'exists_during'
and obj.cvterm_id not in
(select subject_id from cvtermpath p
join cvterm o on p.object_id = o.cvterm_id
join cvterm pt on p.type_id = pt.cvterm_id
where pt.name = 'is_a' and (o.name = 'biological phase' or o.name = 'response to stimulus'))
order by t.name;
This looks correct. You can close this. There will be more filters in the future but I will open a new ticket.
I'll close it after I've arranged for it to go into a log file.
I see the log(s).
Currently the 'phases' are filtered, but not the "response to" terms
This should be fixed now.
These logs look good. I may add additional filters (new ticket). FOr now I need to fix quite a free to make them either 'part_of' or 'phase' terms so I will do this first.
I will probably look at the non-phase ones later but will need a reduced list
exists_during.txt