projectglow / glow

An open-source toolkit for large-scale genomic analysis
https://projectglow.io
Apache License 2.0
263 stars 110 forks source link

notebook continuous integration 11/18 #450

Closed williambrandler closed 2 years ago

williambrandler commented 2 years ago

Signed-off-by: William Brandler William.Brandler@databricks.com

What changes are proposed in this pull request?

use the new log_metadata function for ingest delta to vcf

explicitly set arrow configuration in notebooks for regression steps: spark.conf.set("spark.sql.execution.arrow.maxRecordsPerBatch", 100) (to stop hitting the arrow limit given that each row has a large number of genotypes in an array)

How is this patch tested?

(Details)

codecov[bot] commented 2 years ago

Codecov Report

Merging #450 (805cf7f) into master (a606fbe) will not change coverage. The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #450   +/-   ##
=======================================
  Coverage   93.63%   93.63%           
=======================================
  Files          95       95           
  Lines        4824     4824           
  Branches      466      466           
=======================================
  Hits         4517     4517           
  Misses        307      307           

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update a606fbe...805cf7f. Read the comment docs.