table of contents
other versions
- bookworm 0.11.1-1+b1
- testing 0.12.0-1+b1
- unstable 0.12.0-1+b2
- experimental 0.12.0-2+b1
SHASTA(1) | User Commands | SHASTA(1) |
NAME¶
shasta - nanopore whole genome assembly tool
DESCRIPTION¶
Options allowed only on the command line:¶
- -h [ --help ]
- Write a help message.
- -v [ --version ]
- Identify the Shasta version.
- --config arg
- Configuration file name.
- --input arg
- Names of input files containing reads.Specify at least one.
- --assemblyDirectory arg (=ShastaRun)
- Name of the output directory. Ifcommand is assemble, this directorymust not exist.
- --command arg (=assemble)
- Command to run. Must be one of:assemble, saveBinaryData,cleanupBinaryData, explore,createBashCompletionScript
- --memoryMode arg (=anonymous)
- Specify whether allocated memory isanonymous or backed by a filesystem.Allowed values: anonymous, filesystem.
- --memoryBacking arg (=4K)
- Specify the type of pages used to backmemory.Allowed values: disk, 4K , 2M (for bestperformance). All combinations(memoryMode, memoryBacking) are allowedexcept for (anonymous, disk).Some combinations require rootprivilege, which is obtained using sudoand may result in a password promptingdepending on your sudo set up.
- --threads arg (=0)
- Number of threads, or 0 to use onethread per virtual processor.
- --exploreAccess arg (=user)
- Specify allowed access for --command explore. Allowed values: user, local, unrestricted. DO NOT CHANGE FROM DEFAULT VALUE WITHOUT UNDERSTANDING THE SECURITY IMPLICATIONS.
- --port arg (=17100)
- Port to be used by the http server(command --explore).
Options allowed on the command line and in the config file:¶
- --Reads.minReadLength arg (=10000)
- Read length cutoff. Shorter reads arediscarded.
- --Reads.noCache
- If set, skip the Linux cache whenloading reads. This is done byspecifying the O_DIRECT flag whenopening input files containing reads.
- --Reads.palindromicReads.skipFlagging
- Skip flagging palindromic reads. OxfordNanopore reads should be flagged forbetter results.
- --Reads.palindromicReads.maxSkip arg (=100)
- Used for palindromic read detection.
- --Reads.palindromicReads.maxDrift arg (=100)
- Used for palindromic read detection.
- --Reads.palindromicReads.maxMarkerFrequency arg (=10)
- Used for palindromic read detection.
- --Reads.palindromicReads.alignedFractionThreshold arg (=0.1)
- Used for palindromic read detection.
- --Reads.palindromicReads.nearDiagonalFractionThreshold arg (=0.1)
- Used for palindromic read detection.
- --Reads.palindromicReads.deltaThreshold arg (=100)
- Used for palindromic read detection.
- --Kmers.generationMethod arg (=0)
- Method to generate marker k-mers: 0 =random, 1 = random, excluding globallyoverenriched,2 = random, excludingoverenriched even in a single read,3 =read from file.
- --Kmers.k arg (=10)
- Length of marker k-mers (in run-lengthspace).
- --Kmers.probability arg (=0.1)
- Fraction k-mers used as a marker.
- --Kmers.enrichmentThreshold arg (=100.)
- Enrichment threshold forKmers.generationMethod 1 and 2.
- --Kmers.file arg
- The absolute path of a file containingthe k-mers to be used as markers, oneper line. A relative path is notaccepted. Only used ifKmers.generationMethod is 3.
- --MinHash.version arg (=0)
- Controls the version of the LowHashalgorithm to use. Can be 0 (default) or1.(experimental).
- --MinHash.m arg (=4)
- The number of consecutive markers thatdefine a MinHash/LowHash feature.
- --MinHash.hashFraction arg (=0.01)
- Defines how low a hash has to be to beused with the LowHash algorithm.
- --MinHash.minHashIterationCount arg (=10)
- The number of MinHash/LowHashiterations, or 0 to let--MinHash.alignmentCandidatesPerRead control the number of iterations.
- --MinHash.alignmentCandidatesPerRead arg (=20)
- If --MinHash.minHashIterationCount is 0, MinHash iteration is stopped when the average number of alignment candidates that each read is involved in reaches this value. If --MinHash.minHashIterationCount is not 0, this is not used.
- --MinHash.minBucketSize arg (=0)
- The minimum bucket size to be used bythe LowHash algorithm.
- --MinHash.maxBucketSize arg (=10)
- The maximum bucket size to be used bythe LowHash algorithm.
- --MinHash.minFrequency arg (=2)
- The minimum number of times a pair ofreads must be found by theMinHash/LowHash algorithm in order tobe considered a candidate alignment.
- --MinHash.allPairs
- Skip the MinHash algorithm and mark allpairs of reads as alignmentcandidateswith both orientation. This should onlybe used for experimentation on verysmall runs because it is very timeconsuming.
- --Align.alignMethod arg (=3)
- The alignment method to be used tocreate the read graph & the markergraph. 0 = old Shasta method, 1 = SeqAn(slow), 3 = banded SeqAn.
- --Align.maxSkip arg (=30)
- The maximum number of markers that analignment is allowed to skip.
- --Align.maxDrift arg (=30)
- The maximum amount of marker drift thatan alignment is allowed to toleratebetween successive markers.
- --Align.maxTrim arg (=30)
- The maximum number of unaligned markerstolerated at the beginning and end ofan alignment.
- --Align.maxMarkerFrequency arg (=10)
- Marker frequency threshold. Markersmore frequent than this value in eitherof two oriented reads being aligned arediscarded and not used to compute thealignment.
- --Align.minAlignedMarkerCount arg (=100)
- The minimum number of aligned markersfor an alignment to be used.
- --Align.minAlignedFraction arg (=0)
- The minimum fraction of aligned markersfor an alignment to be used.
- --Align.matchScore arg (=6)
- Match score for marker alignments (onlyused for alignment methods 1 and 3).
- --Align.mismatchScore arg (=-1)
- Mismatch score for marker alignments(only used for alignment methods 1 and3).
- --Align.gapScore arg (=-1)
- Gap score for marker alignments (onlyused for alignment methods 1 and 3).
- --Align.downsamplingFactor arg (=0.10000000000000001)
- Downsampling factor (only used foralignment method 3).
- --Align.bandExtend arg (=10)
- Amount to extend the downsampled band(only used for alignment method 3).
- --Align.sameChannelReadAlignment.suppressDeltaThreshold arg (=0)
- If not zero, alignments between readsfrom the same nanopore channel andclose in time are suppressed. The"read" meta data fields from the FASTAor FASTQ header are checked. If theirdifference, in absolute value, is lessthan the value of this option, thealignment is suppressed. This can helpavoid assembly artifact. This check isonly done if the two reads haveidentical meta data fields "runid","sampleid", and "ch". If any of thesemeta data fields are missing, thischeck is suppressed and this option hasno effect.
- --Align.suppressContainments
- Suppress containment alignments, thatis alignments in which one read isentirely contained in another read,except possibly for up to maxTrimmarkers at the beginning and end.
- --ReadGraph.creationMethod arg (=0)
- The method used to create the readgraph (0 = undirected, default, 1 =directed, experimental).
- --ReadGraph.maxAlignmentCount arg (=6)
- The maximum number of alignments to bekept for each read.
- --ReadGraph.minComponentSize arg (=100)
- The minimum size (number of orientedreads) of a connected component of theread graph to be kept. This iscurrently ignored.
- --ReadGraph.maxChimericReadDistance arg (=2)
- Used for chimeric read detection.
- --ReadGraph.crossStrandMaxDistance arg (=6)
- Maximum distance (edges) forflagCrossStrandReadGraphEdges. Set thisto zero to entirely suppressflagCrossStrandReadGraphEdges.
- --ReadGraph.containedNeighborCount arg (=6)
- Maximum number of alignments to be keptfor each contained read (only used whencreationMethod is 1).
- --ReadGraph.uncontainedNeighborCountPerDirection arg (=3)
- Maximum number of alignments to be keptin each direction (forward, backward)for each uncontained read (only usedwhen creationMethod is 1).
- --ReadGraph.removeConflicts
- Remove conflicts from the read graph.Experimental - do not use.
- --MarkerGraph.minCoverage arg (=10)
- Minimum number of markers for a markergraph vertex.
- --MarkerGraph.maxCoverage arg (=100)
- Maximum number of markers for a markergraph vertex.
- --MarkerGraph.lowCoverageThreshold arg (=0)
- Used during approximate transitivereduction. Marker graph edges withcoverage lower than this value arealways marked as removed regardless ofreachability.
- --MarkerGraph.highCoverageThreshold arg (=256)
- Used during approximate transitivereduction. Marker graph edges withcoverage higher than this value arenever marked as removed regardless ofreachability.
- --MarkerGraph.maxDistance arg (=30)
- Used during approximate transitivereduction.
- --MarkerGraph.edgeMarkerSkipThreshold arg (=100)
- Used during approximate transitivereduction.
- --MarkerGraph.pruneIterationCount arg (=6)
- Number of prune iterations.
- --MarkerGraph.simplifyMaxLength arg (=10,100,1000)
- Maximum lengths (in markers) used ateach iteration of simplifyMarkerGraph.
- --MarkerGraph.crossEdgeCoverageThreshold arg (=0)
- Experimental. Cross edge coveragethreshold. If this is not zero,assembly graph cross-edges with averageedge coverage less than this value areremoved, together with thecorresponding marker graph edges. Across edge is defined as an edge v0->v1with out-degree(v0)>1, in-degree(v1)>1.
- --MarkerGraph.refineThreshold arg (=0)
- Experimental. Length threshold, inmarkers, for the marker graphrefinement step, or 0 to turn off therefinement step.
- --MarkerGraph.reverseTransitiveReduction
- Perform approximate reverse transitivereduction of the marker graph.
- --Assembly.crossEdgeCoverageThreshold arg (=3)
- Maximum average edge coverage for across edge of the assembly graph to beremoved.
- --Assembly.markerGraphEdgeLengthThresholdForConsensus arg (=1000)
- Controls assembly of long marker graphedges.
- --Assembly.consensusCaller arg (=Bayesian:guppy-2.3.5-a)
- Selects the consensus caller for repeatcounts. See the documentation foravailable choices.
- --Assembly.storeCoverageData
- Used to request storing coverage datain binary format.
- --Assembly.storeCoverageDataCsvLengthThreshold arg (=0)
- Used to specify the minimum length ofan assembled segment for which coveragedata in csv format should be stored. If0, no coverage data in csv format isstored.
- --Assembly.writeReadsByAssembledSegment
- Used to request writing the reads thatcontributed to assembling each segment.
- --Assembly.detangleMethod arg (=0)
- Experimental. Specify the method usedto detangle the assembly graph. 0 = nodetangling, 1 = basic detangling.
August 2020 | shasta |