table of contents
TRANSMUTE(1) | NCBI Entrez Direct User's Manual | TRANSMUTE(1) |
NAME¶
transmute - transform data, particularly within NCBI Entrez Direct
SYNOPSIS¶
transmute -x2p|-j2p|-f2p
transmute -align [-a codes] [-g N] [-h N] [-w N]
transmute -j2x (json2xml) [-set tag] [-rec tag] [-nest flat|recurse|plural|singular|depth|element]
transmute -a2x [-set tag] [-rec tag]
transmute -t2x|-c2x|-s2x (tbl2xml / csv2xml / scn2xml) [-set tag] [-rec tag] [-skip N] [-header] [-lower|-upper] [-indent|-flush] columnName1 ...
transmute -i2x (ini2xml)
transmute -m2x (toml2xml)
transmute -y2x (yaml2xml)
transmute -txf (filter-record) [-pattern str] [-exclude str] [-require str] [-min N] [-max N]
transmute -f2x (fsa2xml)
transmute -g2x (gbf2xml)
transmute -g2r (gbf2ref)
transmute -r2p (ref2pmid) [-options confirm|verbose|fast|slow|exact ...]
transmute -gbf (filter-genbank) [-accession acc|-accessions file] [-taxid id|-taxids file] [-organism name] [-exclude str] [-require str] [-truncate]
transmute -revcomp
transmute -remove [-first N] [-last N]
transmute -retain -leading N-trailing N
transmute -replace -offset N|-column N [-delete N] [-insert seq] [-lower]
transmute -extract [-1-based|-0-based] [-lower] feat_loc
transmute -cds2prot [-gcode N] [-frame N|-all] [-stop] [-trim] [-part5] [-part3] [-every] [-between str] [-circular] [-orf] [-max N]
transmute -molwt [-met|-fmet]
transmute -hgvs
transmute -counts
transmute -diff
transmute -codons -nuc seq -prot seq [-frame N] [-three]
transmute -search [-protein] [-circular] [-top] pattern ...
transmute -find [-relaxed] [-sensitive] [-whole] pattern ...
transmute -encodeXML|-decodeXML|-plainXML
transmute -encodeURL|-decodeURL
transmute -encode64|-decode64
transmute -plain
transmute -upper|-lower
transmute -aa1to3|-aa3to1
transmute -relax
transmute -format [fmt] [-xml declaration] [-doctype declaration] [-comment] [-cdata] [-combine] [-self] [-unicode style] [-script style] [-mathml terse]
transmute -filter element action target
transmute -normalize database
DESCRIPTION¶
transmute reads data from standard input, transforms it according to the specified mode, and writes the transformed data to standard output.
OPTIONS¶
Pretty-Printing¶
- -a codes
- Column alignment codes:
Data Conversion¶
- -j2x
- Convert JSON stream to XML suitable for -path navigation.
- -set tag
- Replace set wrapper tag.
- -rec tag
- Replace record wrapper tag.
- -nest flat|recurse|plural|singular|depth|element
- Nested array naming policy.
- -a2x
- Convert text ASN.1 stream to XML suitable for -path navigation.
- -t2x, -c2x, -s2x
- Convert tab-delimited table, comma-separated values file, or semicolon-delimited table, respectively, to XML.
- -set tag
- Replace set wrapper tag.
- -rec tag
- Replace record wrapper tag.
- -skip N
- Skip the first N lines.
- -header
- Use fields from first row for column names.
- -lower
- Convert text to lowercase.
- -upper
- Convert text to uppercase.
- -indent
- Indent XML output.
- -flush
- Do not indent XML output.
- columnName1 ...
- XML object names per column.
- -i2x
- Convert .ini configuration file format to XML.
- -m2x
- Convert TOML configuration file format to XML.
- -y2x
- Convert YAML configuration file format to XML.
- -txf
- Text file filtering.
- -pattern str
- Pattern at start of record.
- -exclude str
- Reject if string is present.
- -require str
- Require presence of string.
- -min N
- Minimum record number.
- -max N
- Maximum record number.
- -f2x
- Convert a FASTA stream to corresponding XML.
- -g2x
- Convert GenBank/GenPept flatfile format to INSDSeq XML.
- -g2r
- Convert GenBank/GenPept flatfile format to Reference XML.
- -r2p [-options option ...]
- Reference Index XML lookup to find PMIDs. Supported option values:
- -gbf
- GenBank/GenPept filtering.
- -accession acc
- Single accession.
- -accessions file
- File of accessions.
- -taxid id
- Single taxon identifier.
- -taxids file
- File of taxon identifiers.
- -organism name
- Organism scientific name.
- -exclude str
- Reject if string is present.
- -require str
- Require presence of string.
- -truncate
- Remove features and sequence.
Sequence Editing¶
- -retain
- Save either end of sequence.
- -leading N
- Keep first N bases or residues.
- -trailing N
- Keep last N bases or residues.
- -replace
- Apply base or residue substition.
- -offset N
- Skip ahead by 0-based count (SPDI), or
- -column N
- Move just before 1-based position (HGVS).
- -delete N
- Delete N bases or residues.
- -insert seq
- Insert given sequence.
- -lower
- Lower-case original sequence.
- -extract
- Use xtract -insd ... feat_location instructions.
Sequence Processing¶
- -cds2prot
- Translate coding region into protein.
- -gcode N
- Genetic code (1 by default).
- -frame N
- Offset in sequence (0-based).
- -stop
- Include stop residue.
- -trim
- Remove trailing Xs and *s.
- -part5
- CDS partial at 5' end.
- -part3
- CDS extends past 3' end.
- -every
- Translate all codons.
- -between str
- Optional string between residues.
- -all
- Simultaneous six-frame translations.
- -circular
- Reprocess first two priming bases at end.
- -orf
- Only capitalize residues at start states.
- -max N
- Number of residues per line.
- -molwt
- Calculate molecular weight of peptide.
Variation Processing¶
- -hgvs
- Convert Human Genome Variation Society variation format to XML.
Sequence Comparison¶
Sequence Searching¶
- -search
- Search for one or more patterns in a sequence, skipping any FASTA definition line (with a leading >). Each pattern can have an optional alias, e.g., GGATCC:BamHI.
Text Searching¶
- -find
- Find one or more patterns in text, allowing digits, spaces, punctuation, and phrases, e.g., "double, double toil and trouble".
- -relaxed
- Match on words with letters and digits, ignoring spacing and punctuation.
- -sensitive
- Case-sensitive match, distinguishing upper-case and lower-case letters.
- -whole
- Match on whole words or multi-word phrases; implies -relaxed.
- pattern
- Pattern to search for.
String Transformations¶
XML¶
- -encodeXML
- XML-encode <, >, &, ", and ' characters.
- -decodeXML
- Decode XML entity references.
- -plainXML
- Remove embedded mixed-content tags and compress runs of spaces.
URL¶
- -encodeURL
- Compress runs of spaces, and URI-escape the result.
- -decodeURL
- URI-unescape the input.
Base64¶
Accent¶
- -plain
- Strip accents from the input.
Case¶
Protein¶
Letters plus Digits¶
- -relax
- Remove all punctuation and compress whitespace.
Customized XML Reformatting¶
- compact
- Compress runs of spaces.
- flush
- Suppress line indentation.
- indent
- Indent according to nesting depth.
- expand
- Place each attribute on a separate line.
- -xml declaration
- Use the given XML declaration.
- -doctype declaration
- Use the given document type declaration.
- -comment
- Preserve comments.
- -cdata
- Preserve cdata blocks.
- -combine
- If the input contains multiple top-level documents, combine them.
- -self
- Keep empty self-closing tags.
- -unicode style
- How to handle Unicode superscript and subscript digits (first converted to ASCII form in all cases).
- fuse
- Run them all together, with no additional markup.
- space
- Add spaces between digits in different positions.
- period
- Add periods between digits in different positions.
- brackets
- Surround superscripts by square brackets and subscripts by parentheses.
- markdown
- Surround superscripts with carets and subscripts with tildes.
- slash
- Add backslashes when going up in height and forward slashes when going down.
- tag
- Put superscripts in XML sup elements and subscripts in sub elements.
- -script style
- How to handle XML sup and sub elements (denoting superscripts and subscripts, respectively).
- -mathml terse
- Flatten MathML markup tersely.
XML Modification¶
- -filter element action target
- Actions:
EFetch XML Normalization¶
- -normalize database
- Adjust XML fields to conform to common conventions.
SEE ALSO¶
align-columns(1), blst2tkns(1), bsmp2info(1), cit2pmid(1), csv2xml(1), disambiguate-nucleotides(1), download-flatfile(1), download-pubmed(1), ds2pme(1), efetch(1), filter-columns(1), filter-genbank(1), gbf2info(1), gbf2tbl(1), gff2xml(1), gff-sort(1), jsonl2xml(1), pma2apa(1), pma2pme(1), pmc2bioc(1), pmc2info(1), print-columns(1), ref2pmid(1), snp2tbl(1), sort-table(1), spdi2tbl(1), systematic-mutations(1), tbl2prod(1), uniq-table(1), xml2fsa(1), xml2tbl(1), xfetch(1), xtract(1).
2025-05-26 | NCBI |