projectglow / glow

An open-source toolkit for large-scale genomic analysis
https://projectglow.io
Apache License 2.0
272 stars 111 forks source link

Compatibility Issue: java.lang.NoSuchMethodError with glow.py on PySpark 3.5.1 #724

Open nickzren opened 2 weeks ago

nickzren commented 2 weeks ago

When using glow.py in a Conda environment with the following setup:

Attempting to load a VCF file results in the following error:

java.lang.NoSuchMethodError: 'java.lang.String org.apache.spark.sql.execution.datasources.PartitionedFile.filePath()'

Steps to Reproduce:

  1. Create a Conda environment with Python 3.10.
  2. Install PySpark 3.5.1 and glow.py via pip.
  3. Attempt to load a VCF file using Glow.

Workaround: Downgrading to PySpark 3.3.2 resolves the issue, and the VCF file loads as expected.

kermany commented 2 weeks ago

can you try building the artifacts based on the latest release (2.0.3) and try again?