nf-core/isoseq
Genome annotation with PacBio Iso-Seq. Takes raw subreads as input, generate Full Length Non Chemiric (FLNC) sequences and produce a bed annotation.
Define where the pipeline should find input data and save output data.
Path to comma-separated file containing information about the samples in the experiment.
string
^\S+\.csv$
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
Fasta file of primers sequences
string
Genome annotation file
string
Run complete pipeline or TAMA only?
string
If entrypoint ‘isoseq’, ccs —chunk option, define the number of batches to run in parallel. If entrypoint ‘map’, split fasta in files of ‘chunk’ sequences.
integer
40
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Use these options to set CCS parameters
ccs —rq option, define the minimum read quality for CCS selection
number
0.9
ccs —min-passes option, define the minimum number of passes to select a CCS
integer
3
ccs —min-snr option, minimum SNR of subreads to use for generating CCS
number
2.5
ccs —min-length option, minimum CCS length for CCS selection
integer
10
ccs —max-length option, maximum CCS length for CCS selection
integer
50000
ccs —top-passes option, maximum number of passes to use for CCS generation
integer
60
Aligner selection
Aligner to use for mapping: minimap2 or ultra
string
Use these options to set TAMA collapse and TAMA merge parameters
TAMA collapse: Capped RNA?
boolean
TAMA collapse: 5 prime wobble threshold
integer
100
TAMA collapse: Splice junction / exon wobble threshold
integer
10
TAMA collapse: 3 prime wobble threshold
integer
100
Reference genome related files and options required for the workflow.
Name of iGenomes reference.
string
Path to FASTA genome file.
string
^\S+\.fn?a(sta)?(\.gz)?$
Do not load the iGenomes reference config.
boolean
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Set the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer
16
Maximum amount of memory that can be requested for any single job.
string
128.GB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Maximum amount of time that can be requested for any single job.
string
240.h
^(\d+\.?\s*(s|m|h|d|day)\s*)+$
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Display version and exit.
boolean
Method used to save pipeline results to output directory.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Incoming hook URL for messaging service
string
Custom config file to supply to MultiQC.
string
Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file
string
Custom MultiQC yaml file containing HTML including a methods description.
string
Boolean whether to validate parameters against the schema at runtime
boolean
true
Show all params when using --help
boolean
Validation of parameters fails when an unrecognised parameter is found.
boolean
Validation of parameters in lenient more.
boolean
Base URL or local path to location of pipeline test dataset files
string
https://raw.githubusercontent.com/nf-core/test-datasets/