LincolnSteinLab / gdc-viewer

A JBrowse plugin for viewing GDC Data
https://lincolnsteinlab.github.io/gdc-viewer/
MIT License
3 stars 2 forks source link

Investigate MAF files from the GDC #81

Open agduncan94 opened 3 years ago

agduncan94 commented 3 years ago

The GDC has both open and controlled MAF files. See below for more information

Colin made a MAFviewer for JBrowse, is this something we could build upon? Try out the plugin and see if it works for MAF files from the GDC. If it uses a URL, try with the open access MAF files first.

It may be that the MAF files require some processing.

GFJHogue commented 3 years ago

This was more tricky than expected. MAFviewer requires running commands to convert the MAF files, and the available command options don't support GDC's MAF specification: https://docs.gdc.cancer.gov/Data/File_Formats/MAF_Format/

Since MAF is BED-like I can use our BaseBEDLikeFeature and possibly see if we can still use the View class provided by MAFViewer.

GFJHogue commented 3 years ago

Okay, now I realize that a "UCSC MAF" (the type for use with MAFViewer) is Multiple Alignment Format, and a "GDC MAF" is Multiple Annotation Format.

These are 2 different genomics file-types with the same abbreviation & file extension... 🤦‍♂️