nf-core/bacass
Simple bacterial assembly and annotation pipeline
Define where the pipeline should find input data and save output data.
Path to tab-separated sample sheet
string
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Parameters for QC and trim short-reads
save trimmed files
boolean
save files that failed to pass trimming thresholds ending in *.fail.fastq.gz
boolean
Skip FastQC
boolean
Skip FastP
boolean
Path to Kraken2 database.
string
Path to the Kmerfinder bacteria database (ie: ‘/path_to/kmerfinder/bacteria/ or ‘/path_to/kmerfinder/bacteria.tar.gz’). Only taxonomic group ‘bacteria’ is allowed. See database installation (guidelines)[https://bitbucket.org/genomicepidemiology/kmerfinder_db/src/master/] for details. The latest version can be downloaded from the KmerFinder Database FTP site or directly during pipeline execution with. Example:
--kmerfinder_db 'ftp://ftp.cbs.dtu.dk/public/CGE/databases/KmerFinder/version/latest/bacteria.tar.gz'
(Note: Downloading the KmerFinder database (~30GB) via FTP can be time-consuming. It’s recommended to pre-download the database and provide the *.tar.gz file using this parameter instead of URL site.).
An older version is available on (Zenodo)[https://zenodo.org/records/13447056/files/20190108_kmerfinder_stable_dirs.tar.gz].
string
Reference FASTA file.
string
Reference GFF file.
string
Master file (*.txt) containing a summary of assemblies available in GeneBank or RefSeq. See: https://ftp.ncbi.nlm.nih.gov/genomes/ASSEMBLY_REPORTS/assembly_summary_refseq.txt
string
Parameters for the assembly
The assembler to use for assembly. Use the appropriate assembler according to the chosen assembly_type. Refer to the README.md for further clarification.
string
Which type of assembly to perform.
string
Extra arguments for Unicycler
string
Allowed technologies for long read assembly.
string
This can be used to supply extra options to the Canu assembler. Will be ignored when other assemblers are used.
string
Which assembly polishing method to use.
string
Parameters for the annotation
The annotation method to annotate the final assembly.
string
Extra arguments for prokka annotation tool.
string
Path to Bakta database
string
Download Bakta database
boolean
Specifies a configuration file for the DFAST annotation method.
string
assets/test_config_dfast.py
Skip running Kraken2 classifier on reads.
boolean
Skip annotating the assembly with Prokka /DFAST.
boolean
Skip running PycoQC
on long read input.
boolean
Skip polishing the long-read assembly with fast5 input. Will not affect short/hybrid assemblies.
boolean
Skip MultiQC
boolean
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Display version and exit.
boolean
Method used to save pipeline results to output directory.
string
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Incoming hook URL for messaging service
string
Custom config file to supply to MultiQC.
string
Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file
string
Custom MultiQC yaml file containing HTML including a methods description.
string
Boolean whether to validate parameters against the schema at runtime
boolean
true
A comma separated string of inputs the schema validation should ignore
string
modules,igenomes_base
Base URL or local path to location of pipeline test dataset files
string
https://raw.githubusercontent.com/nf-core/test-datasets/