Scroll to navigation

HILIVE(1) User Commands HILIVE(1)

NAME

hilive - Realtime Alignment of Illumina Reads

SYNOPSIS

hilive [options]

DESCRIPTION

HiLive is a read mapping tool that maps Illumina HiSeq (or comparable) reads to a reference genome right in the moment when they are produced. This means, read mapping is finished as soon as the sequencer is finished generating the data.

OPTIONS

REQUIRED OPTIONS:

Illumina's BaseCalls directory which contains the sequence information of the reads.
Path to the HiLive index.
Length and types of the read segments.

Required options might be specified either on the command line or in the config file.

GENERAL OPTIONS:

Print this help message and exit.
Print license information and exit.
Path to a config file. Config file is in .ini format. Duplicate keys are not permitted. Instead, use comma-separated lists. Parameters obtained from the command line are prioritized over settings made in the config file.
bcl-dir=./BaseCalls lanes=1 out-cycle=50,100
Path to Illumina's runInfo.xml file. If specified, read lengths, lane count and tile count are automatically set in accordance with the sequencing run. Parameters obtained from the command line or config file are prioritized over settings obtained from the runInfo.xml.
Continue an interrupted HiLive run from a specified cycle. We strongly recommend to load the config file that was automatically created for the original run to continue with identical settings. This config file (hilive_config.ini) can be found in the temporary directory specified with --temp-dir.

SEQUENCING OPTIONS:

Illumina's BaseCalls directory which contains the sequence information of the reads.
Specify the lanes to be considered for read alignment. [Default: 1-8]
Specify the tiles to be considered for read alignment. [Default: [1-2][1-3][01-16] (96 tiles)]
Specify the highest tile number. The tile numbers will be computed by this number, considering the correct surface count, swath count and tile count for Illumina sequencing. This parameter serves as a shortcut for --tiles.
--max-tile 2216
[1-2][1-2][01-16].
Length and types of the read segments. Each segment is either a read ('R') or a barcode ('B'). Please give the segments in the correct order as they are produced by the sequencing machine. [REQUIRED]
--reads 101R,8B,8B,101R
2x101bp reads and 2x8bp barcodes.
Barcode(s) of the sample(s) to be considered for read alignment. Barcodes must match the barcode length(s) as specified with --reads. Delimit different segments of the same barcodes by '-' and different barcodes by ','. [Default: All barcodes]
-b ACCG-ATTG,ATGT-TGAC
for two different barcodes of length 2x4bp.
ID of the sequencing run. Should be obtained from runInfo.xml.
ID of the flowcell. Should be obtained from runInfo.xml.
ID of the sequencing machine. Should be obtained from runInfo.xml.

REPORT OPTIONS:

Path to the directory that is used for the output files. The directory will be created if it does not exist. [Default: ./out]
Format of the output files. Currently, SAM and BAM format are supported. [Default: BAM]
Cycles for that alignment output is written. The respective temporary files are kept. [Default: write only after the last cycle]
The output mode. [Default: ANYBEST] [ALL|A]: Report all found alignments. [BESTN#|N#]: Report the # best found alignments. [ALLBEST|H]: Report all found alignments with the best score. [ANYBEST|B]: Report one best alignment. [UNIQUE|U]: Report only unique alignments.
Activate reporting unmapped reads. [Default: false]
Activate extended CIGAR format for the alignment output files ('=' for matches and 'X' for mismatches instead of using 'M' for both). [Default: false]
Always sort temporary alignment files before writing output. Existing sorted align files are overwritten. This is only necessary if the temp directory is used more than once for new alignments. In general, this is not recommended for most applications. [Default: false (only sort if no sorted files exist)]
Maximal relative length of the front softclip (only relevant during output) [Default: 0.2]
HiLive uses an approach that requires one exact match of a k-mer at the beginning of an alignment. This can lead to unaligned regions at the beginning of the read which we report as 'softclips'. With this parameter, you can control the maximal length of this region.

ALIGNMENT OPTIONS:

Path to the HiLive index. Please use the executable 'hilive-build' to create a new HiLive index that is delivered with this program. The index consists of several files with the same prefix. Please include the file prefix when specifying the index location.
Alignment mode to balance speed and accuracy [very-fast|fast|balanced|accurate| very-accurate]. This selected mode automatically sets other parameters. Individually configured parameters are prioritized over settings made by selecting an alignment mode. [Default: balanced]
Length of the alignment anchor (or initial seed) [Default: set by the selected alignment mode]
The interval to tolerate more errors during alignment (low=accurate; great=fast). [Default: 'anchor-length'/2]
The interval to create new seeds (low=accurate; great=fast). [Default: 'anchor-length'/2]
The maximum length of a front softclip when creating new seeds. In contrast to --max-softclip-ratio, this parameter may have effects on runtime and mapping accuracy. [Default: 'readlength/2]
The number of errors that are tolerated for the barcode segments. A single value can be provided to be applied for all barcode segments. Alternatively, the value can be set for each segment individually. [Default: 1]
--barcode-errors 2 [2 errors for all barcode segments]
--barcode-errors 2,1 [2 errors for the first, 1 error for the second segment]
Align all barcodes. Reads with a barcode that don't match one of the barcodes specified with '--barcodes' will be reported as undetermined. [Default: false]
Minimum basecall quality for a nucleotide to be considered as a match [Default: 1 (everything but N-calls)]
Keep sequences of invalid reads, i.e. with unconsidered barcode or filtered by the sequencer. This option must be activated to report unmapped reads. [Default: false]

SCORING OPTIONS:

Minimum alignment score. [Default: Set automatically based on the alignment mode and match/mismatch scores]
Score for a match. [Default: 0]
Penalty for a mismatch. [Default: 6]
Penalty for insertion opening. [Default: 5]
Penalty for deletion opening. [Default: 5]
Penalty for deletion extension. [Default: 3]
Maximal permitted consecutive gap length. Increasing this parameter may lead to highly increased runtime! [Default: 3]
Penalty for softclip opening (only relevant during output). [Default: 'mismatch-penalty']
Penalty for softclip extension (only relevant during output). [Default: 'mismatch-penalty'/'anchor-length']

TECHNICAL OPTIONS:

Temporary directory to store the alignment files and hilive_config.ini. [Default: ./temp]
Keep intermediate alignment files for these cycles. The last cycle is always kept. [Default: Keep files of output cycles]
HiLive comes with a separated executable 'hilive-out'. This executable can be used to produce alignment files in SAM or BAM format from existing temporary files. Thus, output can only be created for cycles for that keeping the temporary alignment files is activated. Temporary alignment files are also needed if an interrupted run is continued with the '--continue' parameter.
Keep all intermediate alignment files. This option may lead to huge disk space requirements. [Default: false]
Block size for the alignment input/output stream in Bytes. Append 'K' or 'M' to specify in Kilobytes or Megabytes, respectively. [Default: 64M]
--block-size 1024 [1024 bytes] --block-size 64K [64 Kilobytes] --block-size 64M [64 Megabytes]
Compression of temporary alignment files. [Default: LZ4] 0: no compression. 1: Deflate (smaller). 2: LZ4 (faster).
Number of threads to spawn (including output threads). [Default: 1]
Maximum number of threads to use for output. More threads may be used for output automatically if threads are idle. [Default: 'num-threads'/2]

EXAMPLE

hilive --bcl-dir ./BaseCalls --index ./reference/hg19 --reads 101R

AUTHOR

This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.

October 2019 hilive 2.0a