Clinical-Genomics / scout

VCF visualization interface
https://clinical-genomics.github.io/scout
BSD 3-Clause "New" or "Revised" License
147 stars 43 forks source link

genes in structural variants #2969

Open terestahl opened 2 years ago

terestahl commented 2 years ago

When listing genes included in structural variants what is the order the genes are listed? would it be possible to keep the genomic order instead? this will be much more useful specifically to be able to identify the genes involved in the breakpoints

An example of what I get after downloading using the "Filter and Export" tool from scout

SETX | BARHL1 | MED27 | AK8 | TSC1 | DDX31 | PRRC2B | TTF1 | GFI1B | POMT1 | CFAP77 | SPACA9 | NTNG2 | GTF3C4 | RAPGEF1 | UCK1 | EIF4A1P3 | SNORD62B | SNORD62A | RNU7-21P | RN7SL328P | RNU5D-2P

what is the order used????

this is a link to UCSC if you whant to see the genomic order

thanks

northwestwitch commented 2 years ago

Hi! I think genes from structural variants come in the order they were saved in the database for that variant and it's probably not by coordinates. It should be possible but a bit complicated because gene coordinates are not saved inside the variant document at the moment. Example of gene info saved in the variant:

image

In order to get gene coordinates we should query the database for gene coords for each gene belonging to the variant. Could be done but it will slow down a lot the variants download process for variants which contain many genes.

terestahl commented 2 years ago

Ok and it is not possible to re-sort the database by coordinates each time you add new information into it so the genes appear in the right order?

dnil commented 2 years ago

Genes are normally sorted on hgnc_id, and I would assume it's the same here. I do agree it would be a nice feature to show which are closest to the breakpoints for balanced events. We could probably retrieve any gene hit by a breakpoint with fewer queries, but it wouldn't be exactly the same.

northwestwitch commented 2 years ago

Reordering the genes saved in variants document should eventually be done around here in the code: https://github.com/Clinical-Genomics/scout/blob/54809c217a6ce997a3630dabb199f1b09570ef59/scout/build/variant/variant.py#L278

terestahl commented 2 years ago

ok thanks then sorting after genomic position is not an option? It would be more biological relevant anaway


From: Daniel Nilsson @.> Sent: Tuesday, November 9, 2021 4:42:52 PM To: Clinical-Genomics/scout @.> Cc: Teresita Díaz De Ståhl @.>; Author @.> Subject: Re: [Clinical-Genomics/scout] genes in structural variants (Issue #2969)

Genes are normally sorted on hgnc_id, and I would assume it's the same here. I do agree it would be a nice feature to show which are closest to the breakpoints for balanced events. We could probably retrieve any gene hit by a breakpoint with fewer queries, but it wouldn't be exactly the same.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FClinical-Genomics%2Fscout%2Fissues%2F2969%23issuecomment-964272674&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C1d295b27d96541fbea5008d9a397983d%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637720693754824843%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=G62rljtyAvmjzAJrOE6KTLaOCpDYC7EBtDqVW2e9Nlw%3D&reserved=0, or unsubscribehttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FARWZPH7JONEX6MV5B2EK4CTULE6PZANCNFSM5HVPPWHA&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C1d295b27d96541fbea5008d9a397983d%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637720693754834845%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=z%2B76ejubMP2s6b3NoHWzf8TN3wjKInJJuNmkI3oxxrw%3D&reserved=0. Triage notifications on the go with GitHub Mobile for iOShttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fapps.apple.com%2Fapp%2Fapple-store%2Fid1477376905%3Fct%3Dnotification-email%26mt%3D8%26pt%3D524675&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C1d295b27d96541fbea5008d9a397983d%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637720693754834845%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=ysR0oa1W1gf0zBfKp1WcnjRyQYe8hUHbslCQhmBb2Qg%3D&reserved=0 or Androidhttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fplay.google.com%2Fstore%2Fapps%2Fdetails%3Fid%3Dcom.github.android%26referrer%3Dutm_campaign%253Dnotification-email%2526utm_medium%253Demail%2526utm_source%253Dgithub&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C1d295b27d96541fbea5008d9a397983d%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637720693754844836%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=51aJrUfSL37l62jAYHa9x%2Fv7k7O524alOZxEI5yUSm8%3D&reserved=0.

När du skickar e-post till Karolinska Institutet (KI) innebär detta att KI kommer att behandla dina personuppgifter. Här finns information om hur KI behandlar personuppgifterhttps://ki.se/medarbetare/integritetsskyddspolicy.

Sending email to Karolinska Institutet (KI) will result in KI processing your personal data. You can read more about KI’s processing of personal data herehttps://ki.se/en/staff/data-protection-policy.

northwestwitch commented 2 years ago

We are working on some other things at the moment (previous customer requests and maintenance stuff). Let's leave this suggestion open so we can work on it when we have some time and especially if other users are interested in this feature.

terestahl commented 2 years ago

😃👍


From: Chiara Rasi @.> Sent: Wednesday, November 10, 2021 8:44:15 AM To: Clinical-Genomics/scout @.> Cc: Teresita Díaz De Ståhl @.>; Author @.> Subject: Re: [Clinical-Genomics/scout] genes in structural variants (Issue #2969)

We are working on some other things at the moment (previous customer requests and maintenance stuff). Let's leave this suggestion open so we can work on it when we have some time and especially if other users are interested to this features.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FClinical-Genomics%2Fscout%2Fissues%2F2969%23issuecomment-964862547&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C93fe44c1d9b3405ff31208d9a41de655%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637721270598903059%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=ppeAhJZVOYVCGAzH0nMCnEOGmaOhgNW4kAGtrHnTn2E%3D&reserved=0, or unsubscribehttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FARWZPHZTVB7PSJ3ZOU54JZLULIPE7ANCNFSM5HVPPWHA&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C93fe44c1d9b3405ff31208d9a41de655%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637721270598913053%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=KwY93MXTrr9%2FkpCAJSWaAWFqveqvaPcwya0h4Un%2Bl4U%3D&reserved=0. Triage notifications on the go with GitHub Mobile for iOShttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fapps.apple.com%2Fapp%2Fapple-store%2Fid1477376905%3Fct%3Dnotification-email%26mt%3D8%26pt%3D524675&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C93fe44c1d9b3405ff31208d9a41de655%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637721270598913053%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=LIJqySw55edctDChTRbehUpPYC7C2VVz%2FOmtSLzevy8%3D&reserved=0 or Androidhttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fplay.google.com%2Fstore%2Fapps%2Fdetails%3Fid%3Dcom.github.android%26referrer%3Dutm_campaign%253Dnotification-email%2526utm_medium%253Demail%2526utm_source%253Dgithub&data=04%7C01%7Cteresita.diaz.de.stahl%40ki.se%7C93fe44c1d9b3405ff31208d9a41de655%7Cbff7eef1cf4b4f32be3da1dda043c05d%7C0%7C0%7C637721270598923049%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=6NTnPnXMyFU5txP06TVhe%2B6SzMzg19e3AvvWZuGwjI0%3D&reserved=0.

När du skickar e-post till Karolinska Institutet (KI) innebär detta att KI kommer att behandla dina personuppgifter. Här finns information om hur KI behandlar personuppgifterhttps://ki.se/medarbetare/integritetsskyddspolicy.

Sending email to Karolinska Institutet (KI) will result in KI processing your personal data. You can read more about KI’s processing of personal data herehttps://ki.se/en/staff/data-protection-policy.