NAME¶
open_jtalk — Japanese TTS system
SYNOPSIS¶
open_jtalk [
options] [
infile]
DESCRIPTION¶
This manual page documents briefly the 
open_jtalk command.
This manual page was written for the 
Debian distribution because the
  original program does not have a manual page. Instead, it has documentation in
  the GNU 
Info format; see below.
open_jtalk is a program that synthesize speech waveform from Japanese
  texts. It uses HMMs trained by the HMM-based speech synthesis system (HTS).
OPTIONS¶
A summary of options is included below.
 
  - -x dir
 
  - dictionary directory
 
  - -td tree
 
  - decision tree files for state duration
 
  - -tm tree
 
  - Show version of program.
 
  - -tf tree
 
  - decision tree files for Log F0
 
  - -tl tree
 
  - decision tree files for low-pass filter
 
  - -md pdf
 
  - model files for state duration
 
  - -mm pdf
 
  - model files for spectrum
 
  - -mf pdf
 
  - model files for Log F0
 
  - -ml pdf
 
  - model files for low-pass filter
 
  - -dm win
 
  - window files for calculation delta of spectrum
 
  - -df win
 
  - window files for calculation delta of Log F0
 
  - -dl win
 
  - window files for calculation delta of low-pass filter
 
  - -ow s
 
  - filename of output wav audio (generated speech)
 
  - -ot s
 
  - filename of output trace information
 
  - -s i
 
  - sampling frequency [16000][1--48000]
 
  - -p i
 
  - frame period (point) [80][1--]
 
  - -a f
 
  - all-pass constant [0.42][0.0--1.0]
 
  - -g i
 
  - gamma = -1 / i (if i=0 then gamma=0) [0][0--]
 
  - -b f
 
  - postfiltering coefficient [0.0][-0.8--8.0]
 
  - -l
 
  - regard input as log gain and output linear one (LSP)
 
  - -u f
 
  - voiced/unvoiced threshold[0.5][0.0--1.0]
 
  - -em tree
 
  - decision tree files for GV of spectrum
 
  - -ef tree
 
  - decision tree files for GV of Log F0
 
  - -el tree
 
  - decision tree files for GV of low-pass filter
 
  - -cm pdf
 
  - filenames of GV for spectrum
 
  - -cf pdf
 
  - filenames of GV for Log F0
 
  - -cl pdf
 
  - filenames of GV for low-pass filter
 
  - -jm f
 
  - weight of GV for spectrum [1.0][0.0--2.0]
 
  - -jf f
 
  - weight of GV for Log F0 [1.0][0.0--2.0]
 
  - -jl f
 
  - weight of GV for low-pass filter [1.0][0.0--2.0]
 
  - -k tree
 
  - GV switch
 
  - -z i
 
  - audio buffer size [1600][0--48000]
 
  - infile
 
  - text file
 
option '-d' may be repeated to use multiple delta parameters. generated
  spectrum, log F0, and low-pass filter coefficient sequences are saved in
  natural endian, binary (float) format.
 
EXAMPLE¶
If you installed hts-voice-nitech-jp-atr503-m001 in the current directory, the
  following command let you make a voice file from input.txt:
 
% open_jtalk -s 48000 -p 240 -a 0.55 \ 
-td tree-dur.inf    -tm tree-mgc.inf    -tf tree-lf0.inf \ 
-tl tree-lpf.inf    -md dur.pdf         -mm mgc.pdf \ 
-mf lf0.pdf         -ml lpf.pdf         -dm mgc.win1 \ 
-dm mgc.win2        -dm mgc.win3        -df lf0.win1 \ 
-df lf0.win2        -df lf0.win3        -dl lpf.win1 \ 
-em tree-gv-mgc.inf -ef tree-gv-lf0.inf -cm gv-mgc.pdf \ 
-cf gv-lf0.pdf      -k  gv-switch.inf   -ow output.wav \ 
-x  dic_dir input.txt 
 
 
 
AUTHOR¶
This manual page was written by Koichi Akabe vbkaisetsu@gmail.com for the
  
Debian system (and may be used by others). Permission is granted to
  copy, distribute and/or modify this document under the terms of the GNU
  General Public License, Version 2 any later version published by the Free
  Software Foundation.
 
On Debian systems, the complete text of the GNU General Public License can be
  found in /usr/share/common-licenses/GPL.