Scroll to navigation

SEQUITUR-G2P(1) User Commands SEQUITUR-G2P(1)

NAME

sequitur-g2p - grapheme-to-phoneme conversion tool

SYNOPSIS

sequitur-g2p [OPTION]... FILE...

DESCRIPTION

Grapheme-to-Phoneme Conversion

Samples can be either in plain format (one word per line followed by phonetic transcription) or Bliss XML Lexicon format.

OPTIONS

show program's version number and exit
show this help message and exit
Profile execution time and store result in FILE
Report resource usage execution time
Use Psyco to speed up execution
store temporary files in PATH
read training sample from FILE
read held-out training sample from FILE or use N% of the training data
read test sample from FILE
save state of training in regular time intervals. The name of the checkpoint file is derived from --writemodel.
load checkpoint FILE and continue training
Transpose model, i.e. do phoneme-to-grapheme conversion
read model from FILE
write model to FILE
report error rates on development and test set in each iteration
apply model to development set and report error rates
multigrams must have l1 ... l2 left-symbols and r1 ... r2 right-symbols
do not allow new joint-multigrams to be added to the model
estimate model using maximum approximation rather than true EM
ramp up the model
wipe out probabilities, retain only model structure
initialize probabilities estimation by counting how many times every graphone occurs in the training set, disregarding possible overlaps
minimum number of EM iterations during training
maximum number of EM iterations during training
re-adjust discounts in each iteration
set discount to D and keep it fixed
use character set encoding ENC
train/apply a phoneme-to-phoneme converter
evaluate only at segmental level, i.e. do not count syllable boundaries and stress marks
store test result in table FILE (for use with bootlog or R)
apply grapheme-to-phoneme conversion to words read from FILE
generate pronunciation variants until \sum_i p(var_i) >= Q (only effective with --apply)
generate up to N pronunciation variants (only effective with --apply)
use a translation memory (read from sample FILE) instead of a genuine model (use in combination with -x to evaluate two files against each other)
limit size of search stack to N elements
May 2016 sequitur-g2p 0+r1668