dictyBase / Migration

Entrypoint for dictybase overhaul project
0 stars 0 forks source link

Import parent entries for strains #58

Open cybersiddhu opened 8 years ago

cybersiddhu commented 8 years ago

Here is the list. strain_no_parent.txt

Petra will try to fix them.

@pfey03

55

31

pfey03 commented 8 years ago

Thanks for the list Sidd! But I found already a strain that doesn't have a parent Id but is not in your list: DBS0235460 http://dictybase.org/db/cgi-bin/dictyBase/phenotype/strain_and_phenotype_details.pl?genotype_id=55 I'll fix this later today, but wonder if your list is complete?

@rjdodson

pfey03 commented 8 years ago

More missing on list: DBS0236568 DBS0236312 DBS0236313 DBS0236314

You can check them on test as I'm adding IDs as I come across them, but it's irritating those are not on list of 444

cybersiddhu commented 8 years ago

It might not be complete, because there is a refresh lag between my dump and production. So, once we get them fixed in production, i will refresh with prod and rerun to see how many of them are left. I suspect, i have to run it few times before we cover all of them.

pfey03 commented 8 years ago

ok thanks, we start fixing as time allows and when you send new list Bob and I divide it up again to continue

cybersiddhu commented 5 years ago

Needs another re-export. The entries needs to be verified for multiple parents per child.

pfey03 commented 4 years ago

generic mappings for parents with plain name and no ID:

AX2 DBS0237699 AX3 DBS0237700 AX4 DBS0237701 KAX3 DBS0237980 NC4 DBS0350120 DH1 DBS0350130 JH10 DBS0350116

pfey03 commented 4 years ago

I have no time to go through that file above and search ID, then search for the parent it will take a week or more to do that

pfey03 commented 4 years ago

but add the generic of those and that should fix a considerate amount and a generic strain annotation has all the information of the real ones, but are not physical strains in the DSC

pfey03 commented 4 years ago

also, the list above is wrong. Looked at last ID on list DBS0349720 and this has a perfect ID in parenthesis in parent field IDs in parent field are ALWAYS in parenthesis. here the parent is: tgrC1-/QS38|tgrC1 (DBS0349718)

pfey03 commented 4 years ago

so after you map those with parents exactly like in the left column, map them with those IDs and then send a file with remaining and we will se how many those are