atlasxomics / clean_latch

Latch.bio workflow for remediating lane artifacts in spatial ATAC-seq experiments
MIT License
0 stars 0 forks source link

clean

data



clean is a latch.bio workflow for remediating microfluidic artifacts in spatial epigenomic data generated via DBiT-seq. Provided outputs from a single-cell ATAC-seq preprocessing and alignment workflow (ie. Chromap, Cell Ranger ATAC), and a 'spatial' folder from our AtlasXBrowser app, clean returns a fragments.tsv.gz without artifacts and summary statistics from the cleaning.

The workflow identifies outlier rows and columns in which fragment counts are above x (user provided variable) standard deviations of the mean row or column fragment counts. Fragments of tixels in outlier rows/columns are randomly downsampled to the average fragment count of rows/columns. The 'cleaned' fragment table is then sorted for continuous chromosome blocks and compressed with gbzip. The cleaned fragment.tsv.gz file can be used for analysis with ArchR, Seurat, and other scATAC-seq packages.

Inputs

All input files for clean must be on the latch.bio file system.

Running the workflow

The clean workflow can be found in the Workflows module in your latch.bio workspace. For access to an ATX-collaborator workspace, please contact your AtlasXomics Support Scientist or email support@atlasxomics.com. See here for general instructions for running workflows in latch.bio.

  1. Navigate to the clean workflow in the Workflows module in your latch.bio workspace. Ensure you are on the 'Parameters' tab of the workflow.
first_wf
  1. Add values to the input parameters fields, according to the descriptions provided above.
first_wf
  1. Click the 'Launch Workflow' button on the bottom-right of the parameters page. This will automatically navigate you to the Executions tab of the workflow.

  2. From the Executions tab, you can view the status of the launched workflow. Once the workflow has completed running, the status will change to 'Succeeded'; if the workflow has the status 'Failed', please contact an AtlasXomics Support Scientist. You can click on the workflow execution to view a more granular workflow status and see output logs.

  3. Workflow outputs are loaded into the latch.bio data module in the cleaned directory.

Outputs

Outputs from clean are loaded into latch.bio data module in the cleaned directory.

Next Steps

Cleaned fragment files can be used a input in downstream analysis (ArchR, Signac, Seuratm etc.). Analysis can be performed locally or in a latch.bio Pod. For access to ATX-specific Pods, please contact your AtlasXomics Support Scientist.

Further analysis can also be performed in latch.bio with the optimize archr (returns QC data and tests various input parameters on ArchR clustering), create ArchRProject (returns ArchRProject with peak and motif calling) and atlasShiny (returns inputs for the ATX ATAC-seq R Shiny App). For access to these workflows, please contact your AtlasXomics Support Scientist.

Support

Questions? Comments? Contact support@atlasxomics.com or post in AtlasXomics Discord.