Workflows
What is a Workflow?Filters
Somatic-ShortV @ NCI-Gadi is a variant calling pipeline that calls somatic short variants (SNPs and indels) from tumour and matched normal BAM files following GATK's Best Practice Workflow. This workflow is designed for the National Computational Infrastructure's (NCI) Gadi supercompter, leveraging multiple nodes on NCI Gadi to run all stages of the workflow in parallel. ...
Germline-ShortV @ NCI-Gadi is an implementation of the BROAD Institute's best practice workflow for germline short variant discovery. This implementation is optimised for the National Compute Infrastucture's Gadi HPC, utilising scatter-gather parallelism to enable use of multiple nodes with high CPU or memory efficiency. This workflow requires sample BAM files, which can be generated using the Fastq-to-bam @ NCI-Gadi pipeline. Germline-ShortV can be applied ...
Type: Shell Script
Creators: Tracy Chew, Cali Willet, Georgina Samaha, Rosemarie Sadsad
Submitter: Tracy Chew
Bootstrapping-for-BQSR @ NCI-Gadi is a pipeline for bootstrapping a variant resource to enable GATK base quality score recalibration (BQSR) for non-model organisms that lack a publicly available variant resource. This implementation is optimised for the National Compute Infrastucture's Gadi HPC. Multiple rounds of bootstrapping can be performed. Users can use Fastq-to-bam @ NCI-Gadi and Germline-ShortV @ NCI-Gadi to ...
qcif/taxodactyl is a modular, reproducible Nextflow workflow for the conservative taxonomy assignment to DNA sequences, designed for high-confidence, auditable results in biosecurity and biodiversity contexts. The workflow integrates multiple bioinformatics tools and databases, automates best-practice analysis steps, and produces detailed reports with supporting evidence for each taxonomic assignment.
Workflow Overview
The pipeline orchestrates a series of analytical steps, each encapsulated ...
Type: Nextflow
Creators: Magdalena Antczak, Cameron Hyde, Lanxi (Daisy) Li, Valentine Murigneux, Sarah Williams, Michael Thang, Bradley Pease, Shaun Bochow, Grace Sun
Submitter: Magdalena Antczak
A rapid and portable workflow for pond-side sequencing of bacterial pathogens for sustainable aquaculture using ONT long-read sequencing.
Type: Nextflow
Creators: Georgina Samaha, Francisca Samsing, Mitchell O'Brien, Frederick Jaya
Submitter: Georgina Samaha
ONTViSc (ONT-based Viral Screening for Biosecurity)
Introduction
eresearchqut/ontvisc is a Nextflow-based bioinformatics pipeline designed to help diagnostics of viruses and viroid pathogens for biosecurity. It takes fastq files generated from either amplicon or whole-genome sequencing using Oxford Nanopore Technologies as input.
The pipeline can either: 1) perform a direct search on the sequenced reads, 2) generate clusters, 3) assemble the reads to generate longer contigs or 4) directly ...
Type: Nextflow
Creators: Marie-Emilie Gauthier, Craig Windell, Magdalena Antczak, Roberto Barrero
Submitter: Magdalena Antczak
The aim of this workflow is to handle the routine part of shotgun metagenomics data processing. The workflow is using the tools Kraken2 and Bracken for taxonomy classification and the KrakenTools to evaluate diversity metrics. This workflow was tested on Galaxy Australia. A How-to guide for the workflow can be found at: https://github.com/vmurigneu/kraken_howto_ga_workflows/blob/main/pages/taxonomy_kraken2_wf_guide.md
Nextflow Pipeline for DeepVariant
This repository contains a Nextflow pipeline for Google’s DeepVariant, optimised for execution on NCI Gadi.
Quickstart Guide
- Edit the
pipeline_params.yml
file to include:
samples
: a list of samples, where each sample includes the sample name, BAM file path (ensure corresponding .bai is in the same directory), path to an optional regions-of-interest BED file (set to''
if not required), and the model type.ref
: path to the reference FASTA (ensure ...
Post-genome assembly quality control workflow using Quast, BUSCO, Meryl, Merqury and Fasta Statistics, with updates November 2024.
Workflow inputs: reads as fastqsanger.gz (not fastq.gz), and primary assembly.fasta. (To change reads format: click on the pencil icon next to the file in the Galaxy history, then "Datatypes", then set "New type" as fastqsanger.gz). Note: the reads should be those that were used for the assembly (i.e., the filtered/cleaned reads), not the raw reads.
What it does: ...
Type: Galaxy
Creators: Kate Farquharson, Gareth Price, Simon Tang, Anna Syme
Submitters: Johan Gustafsson, Anna Syme
This is part of a series of workflows to annotate a genome, tagged with TSI-annotation
.
These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.
The workflows can be run in this order:
- Repeat masking
- RNAseq QC and read trimming
- Find transcripts
- Combine transcripts
- Extract transcripts
- Convert formats
- Fgenesh annotation
Inputs required: assembled-genome.fasta, hard-repeat-masked-genome.fasta, and (because this workflow maps known mRNA ...