DIAMOND(1) | User Commands | DIAMOND(1) |
NAME¶
diamond - accelerated BLAST compatible local sequence aligner
SYNOPSIS¶
diamond COMMAND [OPTIONS]
DESCRIPTION¶
DIAMOND is a sequence aligner for protein and translated DNA
searches
and functions as a drop-in replacement for the NCBI BLAST software
tools. It is suitable for protein-protein search as well as DNA-protein
search on short reads and longer sequences including contigs and
assemblies, providing a speedup of BLAST ranging up to x20,000.
COMMANDS¶
- makedb
- Build DIAMOND database from a FASTA file
- blastp
- Align amino acid query sequences against a protein reference database
- blastx
- Align DNA query sequences against a protein reference database
- view
- View DIAMOND alignment archive (DAA) formatted file
- help
- Produce help message
- version
- Display version information
- getseq
- Retrieve sequences from a DIAMOND database file
OPTIONS¶
General options:¶
- --threads (-p)
- number of CPU threads
- --db (-d)
- database file
- --out (-o)
- output file
- --outfmt (-f)
- output format
- 0 = BLAST pairwise
- 5 = BLAST XML
- 6 = BLAST tabular
- 100 = DIAMOND alignment archive (DAA)
- 101 = SAM
- Value 6 may be followed by a space-separated list of these keywords:
- qseqid means Query Seq - id
- qlen means Query sequence length
- sseqid means Subject Seq - id
- sallseqid means All subject Seq - id(s), separated by a ';'
- slen means Subject sequence length
- qstart means Start of alignment in query
- qend means End of alignment in query
- sstart means Start of alignment in subject
- send means End of alignment in subject
- qseq means Aligned part of query sequence
- sseq means Aligned part of subject sequence
- evalue means Expect value
- bitscore means Bit score
- score means Raw score
- length means Alignment length
- pident means Percentage of identical matches
- nident means Number of identical matches
- mismatch means Number of mismatches
- positive means Number of positive - scoring matches
- gapopen means Number of gap openings
- gaps means Total number of gaps
- ppos means Percentage of positive - scoring matches
- qframe means Query frame
- btop means Blast traceback operations(BTOP)
- stitle means Subject Title
- salltitles means All Subject Title(s), separated by a '<>'
- qcovhsp means Query Coverage Per HSP
- qtitle means Query title
- Default: qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore
- --verbose (-v)
- verbose console output
- --log
- enable debug log
- --quiet
- disable console output
Makedb options:¶
- --in
- input reference file in FASTA format
Aligner options:¶
- --query (-q)
- input query file
- --un
- file for unaligned queries
- --unal
- report unaligned queries (0=no, 1=yes)
- --max-target-seqs (-k)
- maximum number of target sequences to report alignments for
- --top
- report alignments within this percentage range of top alignment score (overrides --max-target-seqs)
- --compress
- compression for output files (0=none, 1=gzip)
- --evalue (-e)
- maximum e-value to report alignments
- --min-score
- minimum bit score to report alignments (overrides e-value setting)
- --id
- minimum identity% to report an alignment
- --query-cover
- minimum query cover% to report an alignment
- --subject-cover
- minimum subject cover% to report an alignment
- --sensitive
- enable sensitive mode (default: fast)
- --more-sensitive
- enable more sensitive mode (default: fast)
- --block-size (-b)
- sequence block size in billions of letters (default=2.0)
- --index-chunks (-c)
- number of chunks for index processing
- --tmpdir (-t)
- directory for temporary files
- --gapopen
- gap open penalty (default=11 for protein)
- --gapextend
- gap extension penalty (default=1 for protein)
- --matrix
- score matrix for protein alignment (default=BLOSUM62)
- --custom-matrix
- file containing custom scoring matrix
- --lambda
- lambda parameter for custom matrix
- --K
- K parameter for custom matrix
- --comp-based-stats
- enable composition based statistics (0/1=default)
- --seg
- enable SEG masking of queries (yes/no)
- --query-gencode
- genetic code to use to translate query (see user manual)
- --salltitles
- print full subject titles in output files
- --no-self-hits
- suppress reporting of identical self hits
Advanced options:¶
- --min-orf (-l)
- ignore translated sequences without an open reading frame of at least this length
- --freq-sd
- number of standard deviations for ignoring frequent seeds
- --id2
- minimum number of identities for stage 1 hit
- --window (-w)
- window size for local hit search
- --xdrop (-x)
- xdrop for ungapped alignment
- --ungapped-score
- minimum alignment score to continue local extension
- --hit-band
- band for hit verification
- --hit-score
- minimum score to keep a tentative alignment
- --gapped-xdrop (-X)
- xdrop for gapped alignment in bits
- --band
- band for dynamic programming computation
- --shapes (-s)
- number of seed shapes (0 = all available)
- --shape-mask
- seed shapes
- --index-mode
- index mode (0=4x12, 1=16x9)
- --fetch-size
- trace point fetch size
- --rank-factor
- include subjects within this range of max-target-seqs
- --rank-ratio
- include subjects within this ratio of last hit
- --max-hsps
- maximum number of HSPs per subject sequence to save for each query
- --dbsize
- effective database size (in letters)
- --no-auto-append
- disable auto appending of DAA and DMND file extensions
- --target-fetch-size
- number of target sequences to fetch for seed extension
View options¶
- --daa (-a)
- DIAMOND alignment archive (DAA) file
- --forwardonly
- only show alignments of forward strand
Getseq options¶
- --seq
- Sequence numbers to display.
AUTHOR¶
This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.
January 2017 | diamond 0.8.31 |