projectglow / glow

An open-source toolkit for large-scale genomic analysis
https://projectglow.io
Apache License 2.0
266 stars 111 forks source link

Oss docker #503

Closed williambrandler closed 2 years ago

williambrandler commented 2 years ago

What changes are proposed in this pull request?

Open source version of the glow docker container that can be run outside of databricks locally, also includes connectors to microsoft azure data lake, google cloud storage, amazon web services s3, snowflake and delta lake

Contributed by @edg1983, with modifications to add the genomics layer into this container, which includes command line tools (e.g. VEP, samtools) alongside R packages (e.g. bigsnpr) and python packages (e.g. qqman)

Relates to https://github.com/projectglow/glow/issues/494 & https://github.com/projectglow/glow/issues/465

How is this patch tested?

(Details)

codecov[bot] commented 2 years ago

Codecov Report

Merging #503 (058532b) into master (56ef6e4) will not change coverage. The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #503   +/-   ##
=======================================
  Coverage   93.66%   93.66%           
=======================================
  Files          95       95           
  Lines        4875     4875           
  Branches      457      457           
=======================================
  Hits         4566     4566           
  Misses        309      309           

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 56ef6e4...058532b. Read the comment docs.

williambrandler commented 2 years ago

going to merge this into Glow, then update once we have a glow release on Spark 3.2