Workflows
What is a Workflow?Filters
workflow-denovo-stacks
These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.
Galaxy Australia: https://usegalaxy.org.au/
Stacks: http://catchenlab.life.illinois.edu/stacks/
Inputs
- demultiplexed reads in fastq format, may be output from the QC workflow. Files are in a collection.
- population map in text format
Steps and outputs
ustacks:
- input reads go to ustacks.
- ustacks assembles the reads into matching ...
workflow-ref-guided-stacks
These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.
Galaxy Australia: https://usegalaxy.org.au/
Stacks: http://catchenlab.life.illinois.edu/stacks/
Inputs
- demultiplexed reads in fastq format, may be output from the QC workflow. Files are in a collection.
- population map in text format
- reference genome in fasta format
Steps and outputs
BWA MEM 2:
- The reads are mapped to the ...
workflow-qc-of-radseq-reads
These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.
Galaxy Australia: https://usegalaxy.org.au/
Stacks: http://catchenlab.life.illinois.edu/stacks/
Inputs
- demultiplexed reads in fastq format, in a collection
- two adapter sequences in fasta format, for input into cutadapt
Steps and outputs
The workflow can be modified to suit your own parameters.
The workflow steps are:
- Run ...
GermlineShortV_biovalidation
Type: Shell Script
Creators: Georgina Samaha, Tracy Chew, Cali Willet, Nandan Deshpande
Submitter: Georgina Samaha
Shotgun Metagenomics Analysis
Analysis of metagenomic shotgun sequences including assembly, speciation, ARG discovery and more
Description
The input for this analysis is paired end next generation sequencing data from metagenomic samples. The workflow is designed to be modular, so that individual modules can be run depending on the nature of the metagenomics project at hand. More modules will be added as we develop them - this repo is a work in progress!
These scripts have been written ...
Type: Shell Script
Creators: Cali Willet, Rosemarie Sadsad, Tracy Chew, Smitha Sukumar, Elena Martinez, Christina Adler, Henry Lydecker, Fang Wang
Submitter: Tracy Chew
This is a genomics pipeline to do a single germline sample variant-calling, adapted from GATK Best Practice Workflow.
This workflow is a reference pipeline for using the Janis Python framework (pipelines assistant).
- Alignment: bwa-mem
- Variant-Calling: GATK HaplotypeCaller
- Outputs the final variants in the VCF format.
Resources
This pipeline has been tested using the HG38 reference set, available on Google Cloud Storage through:
Type: Janis
Creators: Richard Lupat, Michael Franklin; Jiaan Yu; Juny Kesumadewi
Submitter: Richard Lupat
Combined workflow for large genome assembly
The tutorial document for this workflow is here: https://doi.org/10.5281/zenodo.5655813
What it does: A workflow for genome assembly, containing subworkflows:
- Data QC
- Kmer counting
- Trim and filter reads
- Assembly with Flye
- Assembly polishing
- Assess genome quality
Inputs:
- long reads and short reads in fastq format
- reference genome for Quast
Outputs:
- Data information - QC, kmers
- Filtered, trimmed reads
- Genome assembly, assembly graph, ...
MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
Assess genome quality; can run alone or as part of a combined workflow for large genome assembly.
- What it does: Assesses the quality of the genome assembly: generate some statistics and determine if expected genes are present; align contigs to a reference genome.
- Inputs: polished assembly; reference_genome.fasta (e.g. of a closely-related species, if available).
- Outputs: Busco table of genes found; Quast HTML report, and link to Icarus contigs browser, showing contigs aligned to a reference ...