MetaCerberus Options

Important

If the MetaCerberus environment is not used, make sure the dependencies are in PATH or specified in the config file.

  • Run metacerberus.py with the options required for your project.

Usage of metacerberus.py:

Note

The following are different options/arguments to modify the execution of MetaCerberus.

Setup arguments:

Argument/Option

Function [Default]

Usage Format

Accepted format

Example (Type as one line)

--setup

Setup additional dependencies [False]

--setup

N/A

metacerberus.py --setup

--update

Update downloaded databases [False]

--update

N/A

metacerberus.py --update

--list-db

List available and downloaded databases [False]

--list-db

N/A

metacerberus.py --list-db

--download

Downloads selected HMMs. Use the option --list-db for a list of available databases, default is to download all available databases

--download [DOWNLOAD ...]

--download [.HMM FILE]

--download path/to/example/directory.hmm

--uninstall

Remove downloaded databases and FragGeneScan+ [False]

--uninstall

N/A

metacerberus.py --uninstall

Input File Arguments:

Important

At least one sequence is required.

Accepted formats: [.fastq, .fq, .fasta, .fa, .fna, .ffn, .faa]

Example:

  • metacerberus.py --prodigal file1.fasta

  • metacerberus.py --config file.config

If a sequence is given in [.fastq, .fq] format, one of --nanopore, --illumina, or --pacbio is required.:

Option format interpretation:

  • --setup = accepts no additional options

  • --download DOWNLOAD = accepts one option, (represented by capitalized command ‘DOWNLOAD’)

  • --fraggenescan FRAGGENESCAN [FRAGGENESCAN...] = accepts one or greater options (represented by capitalized commands)

Argument/Option

Function

Usage Format

Accepted format

# Options Accepted

Example (Type as one line)

-c or --config

Path to config file, command line takes priority

-c CONFIG or --config CONFIG

Path to config file

1

metacerberus.py -c path/to/config/file

--prodigal

Prokaryote nucleotide sequence (includes microbes, bacteriophage)

--prodigal PRODIGAL [PRODIGAL ...]

Sequence file

=>1

metacerberus.py --prodigal FILE1 FILE2...

--fraggenescan

Eukaryote nucleotide sequence (includes other viruses, works all around for everything)

--fraggenescan FRAGGENESCAN [FRAGGENESCAN ...]

Sequence file

=>1

metacerberus.py --fraggenescan FILE1 FILE2...

--super

Runs sequence in both –prodigal and –fraggenescan modes

--super SUPER [SUPER ...]

Sequence file

=>1

metacerberus.py --super FILE1 FILE2...

--prodigalgv

Giant virus nucleotide sequence

--prodigalgv PRODIGALGV [PRODIGALGV ...]

Sequence file

=>1

metacerberus.py --prodigalgv FILE1 FILE2...

--phanotate

Phage sequence

--phanotate PHANOTATE [PHANOTATE ...]

Sequence file

=>1

metacerberus.py --phanotate  FILE1 FILE2...

--protein or --amino

Protein Amino Acid sequence

--protein PROTEIN [PROTEIN ...] or --amino PROTEIN [PROTEIN ...]

Sequence file

=>1

metacerberus.py --protein FILE1 FILE2... or metacerberus.py --amino FILE1 FILE2...

--hmmer-tsv

Annotations tsv file from HMMER (experimental)

--hmmer-tsv HMMER_TSV [HMMER_TSV ...]

Sequence file

=>1

metacerberus.py --hmmer-tsv FILE1 FILE2...

--class

path to a tsv file which has class information for the samples. If this file is included, scripts will be included to run Pathview in R

--class CLASS

Path to TSV file

1

metacerberus.py --class TSV_FILE1

--illumina

Specifies that the given FASTQ files are from Illumina

--illumina

N/A

N/A

metacerberus.py --illumina

--nanopore

Specifies that the given FASTQ files are from Nanopore

--nanopore

N/A

N/A

metacerberus.py --nanopore

--pacbio

Specifies that the given FASTQ files are from PacBio

--pacbio

N/A

N/A

metacerberus.py --pacbio

Output options:

Argument/Option

Function [DEFAULT]

Usage Format

Accepted format

# Options Accepted

Example (Type as one line)

--dir-out

path to output directory, defaults to “results-metacerberus” in current directory. [./results-`]

--dir-out DIR_OUT

output file path

1

--dir-out path/to/output/file

--replace

Flag to replace existing files. [False]

--replace

metacerberus.py option

N/A

metacerberus.py --replace

--keep

Flag to keep temporary files. [False]

--keep

metacerberus.py option

N/A

metacerberus.py --keep

--tmpdir

Temp directory for RAY (experimental) [system tmp dir]

--tmpdir TMPDIR

metacerberus.py option

1

metacerberus.py --tmpdir TEMPFILE1

Database options:

Argument/Option

Function [DEFAULT]

Usage Format

Accepted format

# Options Accepted

Example (Type as one line)

--hmm

A list of databases for HMMER. Use the option --list-db for a list of available databases [KOFam_all]

--hmm HMM [HMM ...]

metacerberus.py option

=>1

metacerberus.py --hmm DATABASE1 DATABASE2...

--db-path

Path to folder of databases [Default: under the library path of metacerberus]

--db-path DB_PATH

path to databases folder

1

--db-path path/to/databases/folder

Optional Arguments:

Argument/Option

Function [DEFAULT]

Usage Format

Accepted format

# Options Accepted

Example (Type as one line)

--meta

Metagenomic nucleotide sequences (for prodigal) [False]

--meta

metacerberus.py option

N/A

metacerberus.py --meta

--scaffolds

Sequences are treated as scaffolds [False]

--scaffolds

metacerberus.py option

N/A

metacerberus.py --scaffolds

--minscore

Score cutoff for parsing HMMER results [60]

--minscore MINSCORE

whole integer value

1

metacerberus.py --minscore 50

--evalue

E-value cutoff for parsing HMMER results [1e-09]

--evalue EVALUE

E-value

1

metacerberus.py --evalue [E-value]

--skip-decon

Skip decontamination step. [False]

--skip-decon

metacerberus.py option

N/A

metacerberus.py --skip-decon

--skip-pca

Skip PCA. [False]

--skip-pca

metacerberus.py option

N/A

metacerberus.py --skip-pca

--cpus

Number of CPUs to use per task. System will try to detect available CPUs if not specified [Auto Detect]

--cpus CPUS

whole integer value

1

metacerberus.py --cpus 16

--chunker

Split files into smaller chunks, in Megabytes [Disabled by default]

--chunker CHUNKER

whole integer value

1

metacerberus.py --chunker 300

--grouped

Group multiple fasta files into a single file before processing. When used with –chunker (see above) can improve speed

--grouped

metacerberus.py option

N/A

metacerberus.py --grouped

--version or -v

show the version number and exit

--version or -v

metacerberus.py option

N/A

metacerberus.py --version

-h or --help

show this help message and exit

-h or --help

metacerberus.py option

N/A

metacerberus.py -h

--adapters

FASTA File containing adapter sequences for trimming

--adapters ADAPTERS

FASTA file

1

metacerberus.py --adapters /path/to/FASTA/file

--qc_seq

FASTA File containing control sequences for decontamination

--qc_seq QC_SEQ

FASTA file

1

metacerberus.py --qc_seq /path/to/FASTA/file

Note

Arguments/options that start with -- can also be set in a config file (specified via -c). Config file syntax allows: key=value, flag=true, stuff=[a,b,c] (for details, see syntax. In general, command-line values override config file values which override defaults.