LincolnSteinLab / gdc-viewer

A JBrowse plugin for viewing GDC Data
https://lincolnsteinlab.github.io/gdc-viewer/
MIT License
3 stars 2 forks source link

Use GraphQL to determine some useful metrics regarding index files #91

Closed agduncan94 closed 3 years ago

agduncan94 commented 3 years ago

It would be very useful to know the following: BAM index

VCF index

VCF gzip

Each header should be achievable in a single request. Use the Chrome GraphIQL extension for playing around with these.

agduncan94 commented 3 years ago

Based on a discussion with GDC, all BAM files available through the portal should have index files.

GFJHogue commented 3 years ago

Figured out how to use filtering & aggregations in the GDC GraphQL endpoint. Documentation is sparse, but it's powerful once learned!

BAM index

GFJHogue commented 3 years ago

VCF index

GFJHogue commented 3 years ago

VCF gzip

This was more difficult to query. There is no functionality in the GDC GraphQL from what I could tell that would allow me to examine file-endings for .vcf vs .vcf.gz. The following findings are based on manual exploration of the GDC repository: