.\" Automatically generated by Pod::Man 4.11 (Pod::Simple 3.35) .\" .\" Standard preamble: .\" ======================================================================== .de Sp \" Vertical space (when we can't use .PP) .if t .sp .5v .if n .sp .. .de Vb \" Begin verbatim text .ft CW .nf .ne \\$1 .. .de Ve \" End verbatim text .ft R .fi .. .\" Set up some character translations and predefined strings. \*(-- will .\" give an unbreakable dash, \*(PI will give pi, \*(L" will give a left .\" double quote, and \*(R" will give a right double quote. \*(C+ will .\" give a nicer C++. Capital omega is used to do unbreakable dashes and .\" therefore won't be available. \*(C` and \*(C' expand to `' in nroff, .\" nothing in troff, for use with C<>. .tr \(*W- .ds C+ C\v'-.1v'\h'-1p'\s-2+\h'-1p'+\s0\v'.1v'\h'-1p' .ie n \{\ . ds -- \(*W- . ds PI pi . if (\n(.H=4u)&(1m=24u) .ds -- \(*W\h'-12u'\(*W\h'-12u'-\" diablo 10 pitch . if (\n(.H=4u)&(1m=20u) .ds -- \(*W\h'-12u'\(*W\h'-8u'-\" diablo 12 pitch . ds L" "" . ds R" "" . ds C` "" . ds C' "" 'br\} .el\{\ . ds -- \|\(em\| . ds PI \(*p . ds L" `` . ds R" '' . ds C` . ds C' 'br\} .\" .\" Escape single quotes in literal strings from groff's Unicode transform. .ie \n(.g .ds Aq \(aq .el .ds Aq ' .\" .\" If the F register is >0, we'll generate index entries on stderr for .\" titles (.TH), headers (.SH), subsections (.SS), items (.Ip), and index .\" entries marked with X<> in POD. Of course, you'll have to process the .\" output yourself in some meaningful fashion. .\" .\" Avoid warning from groff about undefined register 'F'. .de IX .. .nr rF 0 .if \n(.g .if rF .nr rF 1 .if (\n(rF:(\n(.g==0)) \{\ . if \nF \{\ . de IX . tm Index:\\$1\t\\n%\t"\\$2" .. . if !\nF==2 \{\ . nr % 0 . nr F 2 . \} . \} .\} .rr rF .\" .\" Accent mark definitions (@(#)ms.acc 1.5 88/02/08 SMI; from UCB 4.2). .\" Fear. Run. Save yourself. No user-serviceable parts. . \" fudge factors for nroff and troff .if n \{\ . ds #H 0 . ds #V .8m . ds #F .3m . ds #[ \f1 . ds #] \fP .\} .if t \{\ . ds #H ((1u-(\\\\n(.fu%2u))*.13m) . ds #V .6m . ds #F 0 . ds #[ \& . ds #] \& .\} . \" simple accents for nroff and troff .if n \{\ . ds ' \& . ds ` \& . ds ^ \& . ds , \& . ds ~ ~ . ds / .\} .if t \{\ . ds ' \\k:\h'-(\\n(.wu*8/10-\*(#H)'\'\h"|\\n:u" . ds ` \\k:\h'-(\\n(.wu*8/10-\*(#H)'\`\h'|\\n:u' . ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'^\h'|\\n:u' . ds , \\k:\h'-(\\n(.wu*8/10)',\h'|\\n:u' . ds ~ \\k:\h'-(\\n(.wu-\*(#H-.1m)'~\h'|\\n:u' . ds / \\k:\h'-(\\n(.wu*8/10-\*(#H)'\z\(sl\h'|\\n:u' .\} . \" troff and (daisy-wheel) nroff accents .ds : \\k:\h'-(\\n(.wu*8/10-\*(#H+.1m+\*(#F)'\v'-\*(#V'\z.\h'.2m+\*(#F'.\h'|\\n:u'\v'\*(#V' .ds 8 \h'\*(#H'\(*b\h'-\*(#H' .ds o \\k:\h'-(\\n(.wu+\w'\(de'u-\*(#H)/2u'\v'-.3n'\*(#[\z\(de\v'.3n'\h'|\\n:u'\*(#] .ds d- \h'\*(#H'\(pd\h'-\w'~'u'\v'-.25m'\f2\(hy\fP\v'.25m'\h'-\*(#H' .ds D- D\\k:\h'-\w'D'u'\v'-.11m'\z\(hy\v'.11m'\h'|\\n:u' .ds th \*(#[\v'.3m'\s+1I\s-1\v'-.3m'\h'-(\w'I'u*2/3)'\s-1o\s+1\*(#] .ds Th \*(#[\s+2I\s-2\h'-\w'I'u*3/5'\v'-.3m'o\v'.3m'\*(#] .ds ae a\h'-(\w'a'u*4/10)'e .ds Ae A\h'-(\w'A'u*4/10)'E . \" corrections for vroff .if v .ds ~ \\k:\h'-(\\n(.wu*9/10-\*(#H)'\s-2\u~\d\s+2\h'|\\n:u' .if v .ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'\v'-.4m'^\v'.4m'\h'|\\n:u' . \" for low resolution devices (crt and lpr) .if \n(.H>23 .if \n(.V>19 \ \{\ . ds : e . ds 8 ss . ds o a . ds d- d\h'-1'\(ga . ds D- D\h'-1'\(hy . ds th \o'bp' . ds Th \o'LP' . ds ae ae . ds Ae AE .\} .rm #[ #] #H #V #F C .\" ======================================================================== .\" .IX Title "DEBIAN/SCORE_CONSERVATION 1" .TH DEBIAN/SCORE_CONSERVATION 1 "2019-12-15" "20110309.0" "User Commands" .\" For nroff, turn off justification. Always turn off hyphenation; it makes .\" way too many mistakes in technical documents. .if n .ad l .nh .SH "NAME" score_conservation \- score protein sequence conservation .SH "SYNOPSIS" .IX Header "SYNOPSIS" score_conservation [options] \s-1ALIGNFILE\s0 .SH "DESCRIPTION" .IX Header "DESCRIPTION" Score protein sequence conservation in \fB\s-1ALIGNFILE\s0\fR. \fB\s-1ALIGNFILE\s0\fR must be in \s-1FASTA, CLUSTAL\s0 or Stockholm format. .PP The following conservation scoring methods are implemented: * sum of pairs * weighted sum of pairs * Shannon entropy * Shannon entropy with property groupings (Mirny and Shakhnovich 1995, Valdar and Thornton 2001) * relative entropy with property groupings (Williamson 1995) * von Neumann entropy (Caffrey et al 2004) * relative entropy (Samudrala and Wang 2006) * Jensen-Shannon divergence (Capra and Singh 2007) .PP A window-based extension that incorporates the estimated conservation of sequentially adjacent residues into the score for each column is also given. This window approach can be applied to any of the conservation scoring methods. .PP With default parameters \fBscore_conservation\fR\|(1) computes the conservation scores for the alignment using the Jensen-Shannon divergence and a window \fB\-w\fR of \fI3\fR. .PP The sequence-specific output can be used as the conservation input for \&\fBconcavity\fR\|(1). .PP Conservation is highly predictive in identifying catalytic sites and residues near bound ligands. .SH "REFERENCES" .IX Header "REFERENCES" .IP "Capra \s-1JA\s0 and Singh M. Predicting functionally important residues from sequence conservation. Bioinformatics, 23(15):1875\-82, 2007." 4 .IX Item "Capra JA and Singh M. Predicting functionally important residues from sequence conservation. Bioinformatics, 23(15):1875-82, 2007." .SH "OPTIONS" .IX Header "OPTIONS" .PD 0 .IP "\-a [\s-1NAME\s0]" 4 .IX Item "-a [NAME]" .PD Reference sequence. Print scores in reference to the named sequence (ignoring gaps). Default prints the entire column. .IP "\-b [0\-1]" 4 .IX Item "-b [0-1]" Lambda for window heuristic linear combination. Default=\fI.5\fR. .Sp Equation: .Sp \&\f(CW\*(C`score = (1 \- lambda) * average_score_over_window_around_middle + lambda * score_of_middle\*(C'\fR .IP "\-d [\s-1FILE\s0]" 4 .IX Item "-d [FILE]" Background distribution file, e.g. \fIdistributions/swissprot.distribution\fR. Default=built\-in \s-1BLOSUM62.\s0 .IP "\-g [0\-1)]" 4 .IX Item "-g [0-1)]" Gap cutoff. Do not score columns that contain more than gap cutoff fraction gaps. Default=\fI.3\fR. .IP "\-h" 4 .IX Item "-h" Print help. .IP "\-l [true|false]" 4 .IX Item "-l [true|false]" Use sequence weighting. Default=\fItrue\fR. .IP "\-m [\s-1FILE\s0]" 4 .IX Item "-m [FILE]" Similarity matrix file, e.g. \fImatrix/blosum62.bla\fR or .qij. Default=\fImatrix/blosum62.bla\fR. .Sp Some methods, e.g. \fIjs_divergence\fR, do not use this. .IP "\-n [true|false]" 4 .IX Item "-n [true|false]" Normalize scores. Print the z\-score (over the alignment) of each column raw score. Default=\fIfalse\fR. .IP "\-o \s-1FILE\s0" 4 .IX Item "-o FILE" Output file. Default: standard output stream. .IP "\-p [true|false]" 4 .IX Item "-p [true|false]" Use gap penalty. Lower the score of columns that contain gaps, proportionally to the sum weight of the gapped sequences. Default=\fItrue\fR. .IP "\-s [\s-1METHOD\s0]" 4 .IX Item "-s [METHOD]" Conservation estimation method, one of \fIshannon_entropy property_entropy property_relative_entropy vn_entropy relative_entropy js_divergence sum_of_pairs\fR. Default=\fIjs_divergence\fR. .IP "\-w [0\-INT]" 4 .IX Item "-w [0-INT]" Window size. Number of residues on either side included in the window. Default=\fI3\fR. .SH "EXAMPLES" .IX Header "EXAMPLES" Note: you may have to copy and uncompress the example data files before running the following examples. .IP "Compute conservation scores for the alignment using the Jensen-Shannon divergence with default settings and print out the scores:" 4 .IX Item "Compute conservation scores for the alignment using the Jensen-Shannon divergence with default settings and print out the scores:" .Vb 1 \& score_conservation /usr/share/doc/conservation\-code/examples/2plc_\|_hssp\-filtered.aln .Ve .IP "Score an alignment using Jensen-Shannon divergence, a window of size 3 (on either side of the residue), and the swissprot background distribution:" 4 .IX Item "Score an alignment using Jensen-Shannon divergence, a window of size 3 (on either side of the residue), and the swissprot background distribution:" .Vb 3 \& score_conservation \-s js_divergence \-w 3 \-d \e \& /usr/share/conservation\-code/distributions/swissprot.distribution \e \& /usr/share/doc/conservation\-code/examples/2plc_\|_hssp\-filtered.aln .Ve .SH "FILES" .IX Header "FILES" .IP "Distributions" 4 .IX Item "Distributions" \&\fI/usr/share/conservation\-code/distributions\fR .IP "Matrices" 4 .IX Item "Matrices" \&\fI/usr/share/conservation\-code/matrix\fR .SH "SEE ALSO" .IX Header "SEE ALSO" .IP "Homepage " 4 .IX Item "Homepage " .PD 0 .IP "Publication " 4 .IX Item "Publication " .IP "\fBconcavity\fR\|(1)" 4 .IX Item "concavity"