PharmGKB / PharmCAT

The Pharmacogenomic Clinical Annotation Tool
Mozilla Public License 2.0
120 stars 39 forks source link

Support for various array data in tabular (CSV/TSV/XLSX) format #177

Open whaleyr opened 5 months ago

whaleyr commented 5 months ago

We have received a few requests to use other input formats for genotype data other than VCF. Particularly, for array data coming from device manufacturers like Thermo Fisher et al. This data is usually provided as CSV or TSV data. We already support reading TSV data for other purposes (e.g. outside calls) so it's technically feasible but the problem is that CSV output of array data is extremely heterogeneous by manufacturer, device, and software package.

We will track what software/platforms people are using here and start experimenting with ways to support that data.

If you have a platform you want to see supported please comment so we can add it to the list.

Please provide either documentation of the file format or synthetic/sample output data for us to examine

Requested platforms: