table of contents
BIO-RAINBOW(1) | User Commands | BIO-RAINBOW(1) |
NAME¶
bio-rainbow - clustering and assembling short reads for bioinformatics
SYNOPSIS¶
rainbow <cmd> [options]
DESCRIPTION¶
rainbow 2.0.4 -- <ruanjue@gmail.com, chongzechen@gmail.com>
- cluster
Input File Format: paired fasta/fastq file(s) Output File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string>
-1 <string> Input fasta/fastq file, supports multiple '-1'
-2 <string> Input fasta/fastq file, supports multiple '-2' [null]
- -l <int>
- Read length, default: 0 variable
- -m <int>
- Maximum mismatches [4]
- -e <int>
- Exactly matching threshold [2000]
- -L
- Low level of polymorphism
- div
Input File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string> Output File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string>[\t<pre_cluster_id:int>]
-i <string> Input file [stdin]
-o <string> Output file [stdout]
- -k <int>
- K_allele, min variants to create a new group [2]
- -K <int>
- K_allele, divide regardless of frequency when num of variants exceed this value [50]
- -f <float>
- Frequency, min variant frequency to create a new group [0.2]
- merge
Input File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string>[\t<pre_cluster_id:int>]
-i <string> Input rbasm output file [stdin]
- -a
- output assembly
-o <string> Output file for merged contigs, one line per cluster [stdout]
- -N <int>
- Maximum number of divided clusters to merge [300]
- -l <int>
- Minimum overlap when assemble two reads (valid only when '-a' is opened) [5]
- -f <float>
- Minimum fraction of similarity when assembly (valid only when '-a' is opened) [0.90]
- -r <int>
- Minimum number of reads to assemble (valid only when '-a' is opened) [5]
- -R <int>
- Maximum number of reads to assemble (valid only when '-a' is opened) [300]
AUTHOR¶
This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.
July 2018 | bio-rainbow 2.0.4 |