Open cybersiddhu opened 8 years ago
Thanks for the list Sidd! But I found already a strain that doesn't have a parent Id but is not in your list: DBS0235460 http://dictybase.org/db/cgi-bin/dictyBase/phenotype/strain_and_phenotype_details.pl?genotype_id=55 I'll fix this later today, but wonder if your list is complete?
@rjdodson
More missing on list: DBS0236568 DBS0236312 DBS0236313 DBS0236314
You can check them on test as I'm adding IDs as I come across them, but it's irritating those are not on list of 444
It might not be complete, because there is a refresh lag between my dump and production. So, once we get them fixed in production, i will refresh with prod and rerun to see how many of them are left. I suspect, i have to run it few times before we cover all of them.
ok thanks, we start fixing as time allows and when you send new list Bob and I divide it up again to continue
Needs another re-export. The entries needs to be verified for multiple parents per child.
generic mappings for parents with plain name and no ID:
AX2 DBS0237699 AX3 DBS0237700 AX4 DBS0237701 KAX3 DBS0237980 NC4 DBS0350120 DH1 DBS0350130 JH10 DBS0350116
I have no time to go through that file above and search ID, then search for the parent it will take a week or more to do that
but add the generic of those and that should fix a considerate amount and a generic strain annotation has all the information of the real ones, but are not physical strains in the DSC
also, the list above is wrong. Looked at last ID on list DBS0349720 and this has a perfect ID in parenthesis in parent field IDs in parent field are ALWAYS in parenthesis. here the parent is: tgrC1-/QS38|tgrC1 (DBS0349718)
so after you map those with parents exactly like in the left column, map them with those IDs and then send a file with remaining and we will se how many those are
Here is the list. strain_no_parent.txt
Petra will try to fix them.
@pfey03
55
31