Scroll to navigation

Text::Kakasi::JP(3pm) User Contributed Perl Documentation Text::Kakasi::JP(3pm)

NAME

Text::Kakasi::JP - Text::Kakasi ueEeUeXiXEXXaaXaXoXE

SYNOPSIS

  use Text::Kakasi;
  # functional
  $res = Text::Kakasi::getopt_argv('kakasi', '-ieuc', '-w');
  $str = Text::Kakasi::do_kakasi($japanese_text);
  # object-oriented
  $obj = Text::Kakasi->new('-ieuc','-w');
  $str = $obj->get($japanese_text);

DESCRIPTION

EeUeXaXXXaaXXXeeXIekakasi (kanji kana simple inverter)XOXIXXXoXXXXXO~XXXXXX XoAeoXXXXXPXXXXkakasiXXAeoXXXXXeeXIXIeXXinput methodXEXO~XXXXXEXieXAXXXEXAXu XPXXXeEXXoXOXeXXXEXXXXXXXXXEXXXiXXXPXuXEXEXEeXO~EN~XXXXXeeXXCXXCXXXXkakasi XIeXPXXEXXn~XIEXXXXAXn~XXEeXaIoIN~XXXeeXXXEXXXCXXPXXXXXUeXXXXXIe <http://kakasi.namazu.org/> XoXaeIXXXXXXXXX
Text::Kakasi XIeXXAaAAayAaXXEXOXoXUXXXXXXXeXOXXXOeXXXXXXXEXOXpXIAD‐EyXIXXXo XXXXXO~XXXXXXXoAeoXXXXXPXXXXXOXoXUXXXXXIEyXIe ver. 1.05 XE 100% XssXXXCXX XXXXPerl 5.8 XEXssXIXXXXCXXoXEXXXEeXIeXXXOeXXXXXXXEXOXpXXXoXXXXXO~XXXXXXXo XEXXEXIXXXXXXeXPXXXX

Functional APIs

A~iXO~:XEXXXIAaIAXCXIe "Text::Kakasi::" XIeXEIXXXXXXXPXXXXXCXO~XXXeeXEXCXIe Text::Kakasi XIeXXXa export XXXPXXXoXXXEXXXIXOXoXo import XXXeeXEeXIeXXIa~ XXXD‐XEXXXIXeXXXEeXXXPXXXX
  use Text::Kakasi qw/getopt_argv do_kakasi/;
$err = getopt_argv($arg1, $arg2, ...)
kakasiXoXeXueXXXXXPXXXXXuXoXIekakasiXIXXXXXXXcXoXEXAauXCXXXXXEXXXIe kakasi 2.3.4 XEeXXXXXeeXXXXXXXcXoXIXXIXXCXXXX
 
  -a[jE] -j[aE] -g[ajE] -k[ajKH]
  -E[aj] -K[ajkH] -H[ajkK] -J[ajkKH]
  -i{oldjis,newjis,dec,euc,sjis}
   -o{oldjis,newjis,dec,euc,sjis}
  -r{hepburn,kunrei} -p -s -f -c"chars" 
   [jisyo1, jisyo2,,,]
  Character Sets:
       a: ascii  j: jisroman  g: graphic  k: kana 
       (j,k     defined in jisx0201)
       E: kigou  K: katakana  H: hiragana J: kanji
       (E,K,H,J defined in jisx0208)
  Options:
    -i: input coding system    -o: output coding system
    -r: romaji conversion system
    -p: list all readings (with -J option)
    -s: insert separate characters (with -J option)
    -f: furigana mode (with -J option)
    -c: skip chars within jukugo
        (with -J option: default TAB CR LF BLANK)
    -C: romaji Capitalize (with -Ja or -Jj option)
    -U: romaji Upcase     (with -Ja or -Jj option)
    -u: call fflush() after 1 character output
    -w: wakatigaki mode
    
 
EeUeXOXoXIeXXAXXuXpXEe0XoXXXXCOXpXEeXIeXXXiXEXXXIA~IXoEOeXXXPXXXX
$processed_str = do_kakasi($str)
$strXEeAD‐XXXkakasiXoXAXOXXXXXeeXIXoEOeXXXPXXXXXXXeXXXIIaeAeXXEXAXXXXX XiXcXX"undef"XoEOeXXXPXXXX
 
Ver. 1.x XEXUXEXeXXXuXoXIXCXeXEe"kakasi"XoXAeXXXeeEXIXXIeXXXeXPXXXoXXXaXA XiXoXXXPXCXEXXXeXAeXXXXaXXXXXPXXXoXX
close_kanwadic()
XAaXXAaXXEeopenXXXiXXXXn~XO~XXXXXeeXocloseXXXPXXXXEeUeXOXoXIeXaXiXIXssXXAXIXss XIXXXaXEeAXXssXXXXXXXssXCXIeIoIN~XXXeeEXIXXIeXPXXXEXXXCXXXcXXXX

Object-Oriented APIs

Version 2.0 XeXeXXEeUeXaXXXaaXXXeeXIeXXXOeXXXXXXXEXOXp(OOP)XIAPIXaAeoXXXXXPXXXX
$k = Text::Kakasi->new($args ...)
XXXOeXXXXXXXEXoAXAXXXXPXXXXXuXoXXAXXssXXXeeXiXcXIeXX "Text::Kakasi->new->set($args ...)" XXXXXIXEXXXXeeXIXEXEXeXPXXXX
$k->set($args ...)
"getopt_argv" XOXIOOPXXXoXXXXXO~XXXXXXXX
 
  my $k = Text::Kakasi->new;
  $k->set('-w'); # Text::Kakasi::getopt_argv('-w');
    
 
XXXXXXXXXXoEOeXX "getopt_argv()" XEXIeXUXEXeXX "set" XIeXieXEeXXXOeXXXXXX XEXXXEXoEOeXXXIXCXXXEXXXIXeXXXEEXXXXXAeCXXCXXXX
 
  my $tokenized = $k->set('-w')->get($raw_japanese);
    
 
XXXXXXXXXXXIssXXXXXiXcXEeXIeXX "$k->error" XoXEXXXPXXXX
$k->error
XCXaaXEeXXD‐XiXXXaXXXA~XEXIXXXXXXXXXXoEOeXXXPXXXX
$processed = $k->get($raw_japanese);
"do_kakasi" XOXIOOPXXXoXXXXXO~XXXXXXXXXEXXXIoXAeXIXXXXXEXIeAauXAXCXXXX
 
  # Functional
  getopt_argv('-w'); $processed = do_kakasi($raw_japanese);
  # OOP
  $k->set('-w')->get($raw_japanese);
    

Perl 5.8 Features

Perl 5.8 XCXXEXXuXXXXXEEN~XXXIEXXaAaXXEIeEAEXssXXEncodeXaXXXaaXXXeeXEeXeXA~ XAeoXXXXXiXPXXXXXXEeUeXaXXXaaXXXeeXIe perl 5.8 XEXssXEeXXXXIXXCXXoIoIN~XXXeeXe XXXEeXEXA~XXXXPXXXX
"$k->set($args ...)"XOXIXuXoXEeXX"-i encoding" XXXeXO "-o encoding"XXXXXPXPXiXeeXiXcXXEXXuXXXXXEEN~XXXIeEncodeXXXOXXXPXXXX XuXoXEeXIekakasiXXXXXYXXXEXXXXXXeeXaXIXEXXXEeXaXXEncodeXXXXXYXXXEXXX-1AE XXXeeXaXIXIeAXXXEXXXPXXXX Ia~XXXD‐XX
  $result = $k->set(qw/-iutf8 -outf8 -w/)->get($utf8);
XIeXEXXXEXUXUeXXAXCXXXX
  $euc = encode('eucjp' => $utf8);
  getopt_argv('-w');
  $tmp = do_kakasi($euc);
  $result = decode('eucjp' => $tmp);
"-outf8",XXXPXPXiXXXXeeXiXcXIeXX "$k->get" XIeutf8XO~XeXXEO~XXIEXXu IoXoEOeXXXPXXXX
XXXiXeXIXXCXXIeXX $Text::Kakasi::HAS_ENCODEXEe0XoAssAeeXXXeeXXXEXCIoIN~XX XEXXXeXXXEeXXXeeXXXEXaXAeCXXCXXXXAeIXieEN~XoXIeXXEeUeXaXXXaaXXXeeXouseXXXXXYXEeXX XAaXXEeAssAeeXXXiXPXXXX

SEE ALSO

kakasi(1), <http://kakasi.namazu.org/>,Encode,perlunicode

COPYRIGHT

  (C) 1998, 1999, 2000 NOKUBI Takatsugu <knok@daionet.gr.jp>
  (C) 2003 Dan Kogai <dankogai@dan.co.jp>
There is no warranty for this free software. Anyone can modify and/or redistribute this module under GNU GENERAL PUBLIC LICENSE. See COPYING file that is included in the archive for more details.
2003-05-26 perl v5.14.2