Closed dacort closed 2 years ago
Glow is a popular toolkit for Genomics analysis, but requires packaging both a Python module and jar packages to be able to run PySpark code.
This PR demonstrates how to build the artifacts for EMR Serverless and run a simple Glow Spark job using 1000 Genomes data.
Glow is a popular toolkit for Genomics analysis, but requires packaging both a Python module and jar packages to be able to run PySpark code.
This PR demonstrates how to build the artifacts for EMR Serverless and run a simple Glow Spark job using 1000 Genomes data.