Open davmlaw opened 1 year ago
http://atlasgeneticsoncology.org/
I took some notes here: https://github.com/SACGF/shariant-admin/issues/142
Sent email summarising results 28 July
We'd need to support ISCN
There is a Python PIP package Git Repo
pip install ISCNSNAKE
Also HGVS uncertainty - need to modify biocommons to support it:
https://github.com/biocommons/hgvs/issues/331 - support uncertain offsets https://github.com/biocommons/hgvs/issues/225 - support uncertain ranges
Hopefully adding some null fields doesn't add that much space, but we should def do a lot of benchmarks to see what the overhead here is making such a large change to Variant table
Cyto have a lot of data. It's currently stored in NxClinical in ISCN format
They use microarray probes that tile the genome, so data has inherit uncertainty - ie between these 2 probes, though sometimes they will export the data to be ISCNs w/o uncertainty, using say midpoint between 2 probes
If we could bring in cyto data, it would be useful for SA Path and Shariant. To do that, we would need to support uncertainty.
I uncertainty is a fundamental part of Variant, ie the same start/end but with different uncertainty is different.
You need to be able to store no uncertainty and unknown (ie ?), maybe magic numbers to avoid another boolean field, ie: