Closed DafniG closed 3 years ago
Hi!
The _get_variants
function in SummaryInputation.py is only to standardize all your variants from the results file. The Attribute error
could be from the version of pyarrow you're using. Could you try running the code again with pyarrow=0.11.1 and numpy=1.18.1 ?
Hi Natasha. Thank you for your fast reply! That seemed to fix the problem! I had to use python 3.6 as on 3.8 i could not install pyarrow=0.11.1, at least not ussing pip.
I ran into the same issue, following the tutorial using the conda env as specified in master: https://github.com/hakyimlab/summary-gwas-imputation/blob/master/src/conda_env.yaml This has pyarrow=0.11.0. Upgrading to 0.11.1 did not fix. numpy=1.18.1
From debugging the script the error manifests on the line: https://github.com/hakyimlab/summary-gwas-imputation/blob/master/src/genomic_tools_lib/file_formats/Parquet.py#L211
which causes exception: AttributeError: 'pyarrow.lib.ChunkedArray' object has no attribute 'name'
A fix for the issue was committed in a separate branch: https://github.com/hakyimlab/summary-gwas-imputation/blob/ImageXcan_changes/src/genomic_tools_lib/file_formats/Parquet.py#L213-L216
The fix calls the '_name' attribute, instead of 'name' when building the dict.
I applied this change to master, and the gwas_summary_imputation.py script now works.
Is the 'ImageXcan_changes' branch more up to date than master? There look to be several other fixes mentioned in the branch.
Hi,
Thank you for the great documentation and scripts! I am struggling to run the gwas_summary_imputation.py script after having harmonised all of my GWAS. I used the standard files you provide to compile the reference parquet files.
Example of a log file:
Command used to run it:
I tried to narrow down where the issue might be and I think where the error is encoutered is at line 252 of SummaryInputation.py:
variants = _get_variants(context, ids)
I was a bit confused how get_variants gets defined and could not troubleshoot further. Any help would be appreciated!I am running the pipeline using python 3.8.3, pyarrow 3.0.0 and numpy 1.20.1.