RajLabMSSM / QTL-mapping-pipeline

A snakemake pipeline for QTL mapping. Based on the GTEX pipeline
4 stars 0 forks source link

add group ID and strand columns to expression.bed.gz and splicing.bed.gz #4

Closed jackhump closed 4 years ago

jackhump commented 4 years ago

GTEX pipeline produces BED files formatted for FASTQTL, not QTLtools. The only difference is that FASTQTL BED files have 6 columns before the per-sample columns:

Chromosome ID [string] Start genomic position of the phenotype (here the TSS of gene1) [integer, 0-based] End genomic position of the phenotype (here the TSS of gene1) [integer, 1-based] Phenotype ID (here the exon IDs) [string]. Phenotype group ID (here the gene IDs, multiple exons belong to the same gene) [string] Strand orientation [+/-]

These final two columns must be added in the eqtl_prepare_expression.py and the sqtl_prepare_expression.py

jackhump commented 4 years ago

for both sQTLs and eQTLs, group ID must be gene.