Open 1tilly opened 7 years ago
Hi Tobias,
I’ll update the document about how to prepare the phenotype file. Currently, the sample ID column should be named ‘iid’. I’ll also test the 1000genomes data you used and get back to you later.
Di
On Jul 12, 2017, at 10:31 AM, Tobias Tilly notifications@github.com wrote:
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/integrated_call_samples_v3.20130502.ALL.panel ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/integrated_call_samples_v3.20130502.ALL.panel
Hi all, I just tested your project but sadly got an exception[0]. For testing purposes I used the chromosome 9 of the 1000genome project(phase 3) [1] and the corresponding panel[2]. I renamed the vcf and panel for personal organization preferences. I have a spark(v2.1.1) and hdfs(v2.8.0) standalone on my workstation installed. Both work just fine (I'm currently using them via pyspark). Attached you'll find my config[3] and the command[4] I'm running on the cli.
Best, 1Tilly
[0]
[1] ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/ALL.chr9.phase3_shapeit2_mvncall_integrated_v5a.20130502.genotypes.vcf.gz
[2] ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/integrated_call_samples_v3.20130502.ALL.panel
[3]
[4]
./bin/seqspark SingleStudy conf/demo2.conf