Open niki0211 opened 3 weeks ago
We could add another column with DOIs, or PubMed IDs.
Carried over from other issues:
For some sample, we don't have representative genomes from Genbank but only from empop-db. E.G. "B2a1c" has no representative in GenBank dataset but in our internal empop-db: "AZMH045, AZMH044".
In such cases we can show: EMP00847;USA
Todo: update "mitoTree_61302_representatives.txt"
the country for OR182493.1 is listed as NA, but I seen in its Genbank entry: "/note="ethnicity:Ashkenazi; origin_locality:Hungary:Tiszaszentimre". This is due to the fact that we are retrieving the country information from the NCBI FEATURE '/origin'. (see documentation).
Adapt the logic to retrieve origin information also from NCBI FEATURE "/note" "origin_locality"
Additionally, add hyperlinks
Update documentation to include all of these.
All of the above will be included in the new data pipeline.
From Feedback: Add variants info and add download functionality as in:
https://www.mitomap.org/cgi-bin/haplo_group?data=1&hg=H1a1a1
Keep in mind and try if this can be realized. Maybe this would need another card with "Representative Mitogenome Sequences".
added a file with updated representatives. See commit 67ea0b7.
The file contains header lines marked with "#" for now. Should be deleted when the issue is closed.
Need to add the new profiles to profiles.csv as well.
added file for 1000genomes with commit 45a65f3 as well.
empop files were sent to @minimops. Maybe you can add them here as well?
Not sure if we should keep them seperated or put them alltogether.
Add available information about existing Publications.