theiagen / public_health_bioinformatics

Bioinformatics workflows for genomic characterization, submission preparation, and genomic epidemiology of pathogens of public health concern.
GNU General Public License v3.0
33 stars 15 forks source link

[TheiaProk] upgrade mlst docker image to 2024-06-01 staphb build; reduced runtime parameters; enable preemptible #516

Closed kapsakcj closed 1 week ago

kapsakcj commented 1 week ago

This PR closes #510

🗑️ This dev branch should be deleted after merging to main.

:brain: Aim, Context and Functionality

:hammer_and_wrench: Impacted Workflows/Tasks & Changes Being Made

This will affect the behavior of the workflow(s) even if users don’t change any workflow inputs relative to the last version : Yes, if the mlst scheme has changed for the given organism

Running this workflow on different occasions could result in different results, e.g. due to use of a live database, "latest" docker image, or stochastic data processing : No

Impacted workflows:

:clipboard: Workflow/Task Step Changes

🔄 Data Processing

Docker/software or software versions changed: upgraded to copy of StaPH-B's mlst docker image built 2024-06-01 staphb/mlst:2.23.0-2024-06-01. previously used build from 2024-03-11

Databases or database versions changed: mlst database built on 2024-06-01

Data processing/commands changed: N/A

File processing changed: N/A

Compute resources changed: reduced cpu to 1, memory to 2, disk_size to 50; enabled preemptible VMs

➡️ Inputs

⬅️ Outputs

N/A

:test_tube: Testing

Test Dataset

Testing on Terra with 2 A. baummannii samples. No obligate need to test other species, but would be beneficial if the reviewer can test more species

Commandline Testing with MiniWDL or Cromwell (optional)

Tested successfully w miniwdl on a Shigella sample

Terra Testing

Suggested Scenarios for Reviewer to Test

Diverse set of bacterial species if you have the data readily available

Theiagen Version Release Testing (optional)

:microscope: Final Developer Checklist

🎯 Reviewer Checklist

🗂️ Associated Documentation (to be completed by Theiagen developer)

kapsakcj commented 1 week ago

waiting for test wfs to finish running but I expect them to succeed. marking as ready-for-review

michellescribner commented 1 week ago

Tested dataset of 88 bacterial isolates representing diverse taxa (set 3 from GAMBIT publication): https://app.terra.bio/#workspaces/theiagen-validations/Theiagen_Scribner_Sandbox/job_history/18a97207-b6d8-4374-891a-49b61b155240

image