[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[freewnn:00542] Re: Manuals
G'day,
少し遅くなりましたが、いくつかのマンページの英訳を送らせて頂けます。
コメント、指摘、freewnnへの組み込み等をお願いします。
では atod.1, atof.1, dtoa.1, pubdic.5, usr_dic.5 をどうぞ
========================================================================
.\"
.\" $Id: ./atod.man $
.\"
.\"
.\" FreeWnn is a network-extensible Kana-to-Kanji conversion system.
.\" This file is part of FreeWnn.
.\"
.\" Copyright Kyoto University Research Institute for Mathematical Sciences
.\" 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright OMRON Corporation. 1987, 1988, 1989, 1990, 1991, 1992, 1999
.\" Copyright ASTEC, Inc. 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright FreeWnn Project 1999, 2000
.\"
.\" Maintainer: FreeWnn Project <freewnn@tomo.gr.jp>
.\"
.\" This program is free software; you can redistribute it and/or modify
.\" it under the terms of the GNU General Public License as published by
.\" the Free Software Foundation; either version 2 of the License, or
.\" (at your option) any later version.
.\"
.\" This program is distributed in the hope that it will be useful,
.\" but WITHOUT ANY WARRANTY; without even the implied warranty of
.\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
.\" GNU General Public License for more details.
.\"
.\" You should have received a copy of the GNU General Public License
.\" along with this program; if not, write to the Free Software
.\" Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
.\"
.TH ATOD \ "30 April 2001"
.SH NAME
.sv 1
.nf
.ta 0.1i 2i
atod EUC Text to Binary Dictionary Converter
.fi
.SH SYNOPSIS
.sv 1
.nf
.ta 0.1i 3i
atod [-s \fIno_of_entries\fR] [-R] [-S] [-U] [-r] [-N] [-n]
[-P \fIpassword_file\fR] [-p \fIfrequency_password_file\fR]
<\fIbinary_dictionary\fR>
.fi
.SH DESCRIPTION
.HP 0
.IP
atod converts an appropriately formatted euc-encoded file to the
binary dictionary format used by wnn. The default EUC encoding is
UJIS. To use other euc encodings, you must set the environment variable
CSWIDTH, as described below:
.br
.br
CSWIDTH=b1[:c1][,b2[:c2][,b3[:c3]]]
.br
.br
"b1-b3" are the number of bytes of the code-sets (except SS2 and SS3).
.br
"c1-c3" are the number of columns of the code-sets.
.br
"b1" and "c1" are for code-set 1.
.br
"b2" and "c2" are for code-set 2.
.br
"b3" and "c3" are for code-set 3.
.br
The value of CSWIDTH for UJIS is 2,1,2.
"\fB\-s\fR" specifies the amount of memory to be allocated. This only
needs to be set if atod exits with an error-message saying that it
does not have enough memory. In that case rerun it with more
memory.
atod accepts the following options:
.TP 8
\fB\-s\fR \fIno_of_entries\fR
This should be a number slightly larger than the number of entries in
the dictionary. The default is 70000.
.TP 8
\fB\-R\fR
Convert the dictionary into a reversible format. (default)
.TP 8
\fB\-S\fR
Convert the dictionary into a fixed format.
.TP 8
\fB\-U\fR
Convert the dictionary into an updatable format.
.TP 8
\fB\-r\fR
Reverse the pronunciation and kanji in the EUC dictionary.
.TP 8
\fB\-P\fR \fIpassword_fileq \fR
Specify the password file for the dictionary.
.TP 8
\fB\-p\fR \fIfrequency_password_file\fR
Specify the password file for the frequency file.
.TP 8
\fB\-N\fR
Set the password of the dictionary to "*".
.TP 8
\fB\-n\fR
Set the password of the frequency file to "*".
.SH FILES
.HP 0
.IP
Because it is used infrequently, atof is not normally installed into
/usr/local/bin or /usr/bin. Instead it can be found at:
.br
.PD 0
.B /usr/local/bin/Wnn4/atod (default)
.br
.B /usr/bin/Wnn4/atod (debian)
.PD
.SH "SEE ALSO"
.sv 1\fR
.nf
jserverrc(4)
========================================================================
.\"
.\" $Id: ./atof.man $
.\"
.\"
.\" FreeWnn is a network-extensible Kana-to-Kanji conversion system.
.\" This file is part of FreeWnn.
.\"
.\" Copyright Kyoto University Research Institute for Mathematical Sciences
.\" 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright OMRON Corporation. 1987, 1988, 1989, 1990, 1991, 1992, 1999
.\" Copyright ASTEC, Inc. 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright FreeWnn Project 1999, 2000
.\"
.\" Maintainer: FreeWnn Project <freewnn@tomo.gr.jp>
.\"
.\" This program is free software; you can redistribute it and/or modify
.\" it under the terms of the GNU General Public License as published by
.\" the Free Software Foundation; either version 2 of the License, or
.\" (at your option) any later version.
.\"
.\" This program is distributed in the hope that it will be useful,
.\" but WITHOUT ANY WARRANTY; without even the implied warranty of
.\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
.\" GNU General Public License for more details.
.\"
.\" You should have received a copy of the GNU General Public License
.\" along with this program; if not, write to the Free Software
.\" Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
.\"
.TH ATOF \ "30 April 2001"
.SH NAME
.sv 1
.nf
.ta 0.1i 2i
atof Convert Function Word Information to Special Format
.fi
.SH SYNOPSIS
.sv 1
.nf
.ta 0.1i 3i
atof <\fIfzk.data_filename\fR>
.fi
.SH DESCRIPTION
.HP 0
.IP
atof takes UJIS style EUC encoded information about function words
(fuzokugo: fzk(4)) from standard input and outputs it in a special
format (fzk.data(4)).
If the input contains information about separate entries with the same
pronunciation and part of speech, atof combines them into a single
function word and outputs a message to stderr. The adjacency
information of the new function word will be the disjunction of the
information of the separate entries. (For example, when the stem and
inflecting part of an inflecting word are defined separately.)
.SH FILES
.HP 0
.IP
Because it is used infrequently, atof is not normally installed into
/usr/local/bin or /usr/bin. Instead it can be found at:
.br
.PD 0
.B /usr/local/bin/Wnn4/atof (default)
.br
.B /usr/bin/Wnn4/atof (debian)
.PD
.SH "SEE ALSO"
.sv 1
.nf
fzk.u(4), fzk.data(4)
========================================================================
.\"
.\" $Id: ./dtoa.man $
.\"
.\"
.\" FreeWnn is a network-extensible Kana-to-Kanji conversion system.
.\" This file is part of FreeWnn.
.\"
.\" Copyright Kyoto University Research Institute for Mathematical Sciences
.\" 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright OMRON Corporation. 1987, 1988, 1989, 1990, 1991, 1992, 1999
.\" Copyright ASTEC, Inc. 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright FreeWnn Project 1999, 2000
.\"
.\" Maintainer: FreeWnn Project <freewnn@tomo.gr.jp>
.\"
.\" This program is free software; you can redistribute it and/or modify
.\" it under the terms of the GNU General Public License as published by
.\" the Free Software Foundation; either version 2 of the License, or
.\" (at your option) any later version.
.\"
.\" This program is distributed in the hope that it will be useful,
.\" but WITHOUT ANY WARRANTY; without even the implied warranty of
.\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
.\" GNU General Public License for more details.
.\"
.\" You should have received a copy of the GNU General Public License
.\" along with this program; if not, write to the Free Software
.\" Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
.\"
.TH DTOA \ "30 April 2001"
.SH NAME
.sv 1
.nf
.ta 0.1i 2i
dtoa Binary Dictionary to EUC Text Converter
.fi
.SH SYNOPSIS
.sv 1
.nf
.ta 0.1i 3i
.B dtoa [-n] [-s] [-e|-E] [-h \fIPOS data_file\fR]
<\fIbinary_dictionary\fR> [<\fIfrequency_data\fR> ...]
.fi
.SH DESCRIPTION
.HP 0
.IP
dtoa converts the specified wnn binary dictionary to a euc-encoded file,
and sends it to stdout. The default EUC encoding is UJIS. To
output to other euc encodings, you must set the environment variable
CSWIDTH, as described in atod(1).
The basic output format is:
Pronunciation Kanji POS Frequency
One or more frequency files can also be given as inputs, and they will
be used to calculate the frequency.
.TP
\fB\-n\fR
Sort the EUC encoded dictionary by pronunciation (long vowel marker
comes first), then hiragana order, then ASCII order.
.TP
\fB\-s\fR
Add serial numbers to each entry.
.TP
\fB\-e\fR
Expand special expressions (default). With this option, spaces, tabs
and so on will be expanded into 8-bit expressions.
.TP
\fB\-E\fR
Don't expand special expressions. With this option, spaces, tabs
and so on will not be expanded into 8-bit expressions.
.TP
\fB\-h\fR
Specify the POS data file. The default is
\fI/usr/local/lib/wnn/ja_JP/hinsi.data\fR.
.SH FILES
.HP 0
.IP
Because it is used infrequently, atof is not normally installed into
/usr/local/bin or /usr/bin. Instead it can be found at:
.br
.PD 0
.B /usr/local/bin/Wnn4/dtoa (default)
.br
.B /usr/bin/Wnn4/dtoa (debian)
.PD
.SH "SEE ALSO"
.sv 1
.nf
atod(1), dtoa(1), wnntouch(1)
========================================================================
.\"
.\" $Id: ./pubdic.man $
.\"
.\"
.\" FreeWnn is a network-extensible Kana-to-Kanji conversion system.
.\" This file is part of FreeWnn.
.\"
.\" Copyright Kyoto University Research Institute for Mathematical Sciences
.\" 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright OMRON Corporation. 1987, 1988, 1989, 1990, 1991, 1992, 1999
.\" Copyright ASTEC, Inc. 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright FreeWnn Project 1999, 2000
.\"
.\" Maintainer: FreeWnn Project <freewnn@tomo.gr.jp>
.\"
.\" This program is free software; you can redistribute it and/or modify
.\" it under the terms of the GNU General Public License as published by
.\" the Free Software Foundation; either version 2 of the License, or
.\" (at your option) any later version.
.\"
.\" This program is distributed in the hope that it will be useful,
.\" but WITHOUT ANY WARRANTY; without even the implied warranty of
.\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
.\" GNU General Public License for more details.
.\"
.\" You should have received a copy of the GNU General Public License
.\" along with this program; if not, write to the Free Software
.\" Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
.\"
.TH PUBDIC \ "15 April 2001"
.SH NAME
.sv 1
.nf
.ta 0.1i 2i
PUBDIC Fixed Format Dictionaries (Japanese)
.fi
.SH SYNOPSIS
.sv 1
.nf
.ta 0.1i 3i
/usr/local/lib/wnn/dic/ja_JP/pubdic/*.dic (Default)
/var/lib/wnn/ja_JP/dic/pubdic/*.dic (Debian)
.fi
.SH DESCRIPTION
.HP 0
.IP
These are the kana-kanji conversion fixed format dictionaries used by
uum(1). How to configure the dictionaries is given in wnnenvrc(4).
The frequency files for each user are made by uum on start up.
The default is
JSERVER_DIR/usr/user_name/freq_file_name.h
However the path depends on uumrc(4). JSERVER_DIR is set to
jserver_dir in jserverrc(4) (in Debian it is /var/lib/wnn/ja_JP/dic/).
There are 10 fixed format dictionaries supplied with the system
(pubdic/*.dic). These are based on the set known as pubdic+.
.ta 0.1i 2i 5i
File Description Size
kihon Basic (level 1) 28,340
tankan Single Character (level 1) 2,920
chimei Place Name (level 1) 4,730
jinmei Person Name (level 1) 3,480
setsuji Affix (level 1) 1,080
computer Computer (level 1) 1,000
bio Life Sciences (level 1) 580
koyuu Other Proper Names (level 1) 300
symbol Symbols (named) (level 1) 190
special Special words (level 1) 30
.fi
The dictionaries can be converted to plain text (euc-jp encoded) using
dtoa(1), and are constructed using atod(1).
The freewnn project also includes some contributed dictionaries:
gerodic (g-jinmei: 23,346 names) and the wnn consortium's (wnncons) single
character dictionaries (tankan2: 4265 kanji; tankan3: 12,361 kanji).
.SH "SEE ALSO"
.sv 1
.nf
uum(1), jserver(1), wnnenvrc(4), jserverrc(4), dtoa(1), atod(1).
========================================================================
.\"
.\" $Id: ./usr_dic.man $
.\"
.\"
.\" FreeWnn is a network-extensible Kana-to-Kanji conversion system.
.\" This file is part of FreeWnn.
.\"
.\" Copyright Kyoto University Research Institute for Mathematical Sciences
.\" 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright OMRON Corporation. 1987, 1988, 1989, 1990, 1991, 1992, 1999
.\" Copyright ASTEC, Inc. 1987, 1988, 1989, 1990, 1991, 1992
.\" Copyright FreeWnn Project 1999, 2000
.\"
.\" Maintainer: FreeWnn Project <freewnn@tomo.gr.jp>
.\"
.\" This program is free software; you can redistribute it and/or modify
.\" it under the terms of the GNU General Public License as published by
.\" the Free Software Foundation; either version 2 of the License, or
.\" (at your option) any later version.
.\"
.\" This program is distributed in the hope that it will be useful,
.\" but WITHOUT ANY WARRANTY; without even the implied warranty of
.\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
.\" GNU General Public License for more details.
.\"
.\" You should have received a copy of the GNU General Public License
.\" along with this program; if not, write to the Free Software
.\" Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
.\"
.TH USR_DIC \ "15 April 2001"
.SH NAME
.sv 1
.nf
.ta 0.1i 2i
USR_DIC Updatable Dictionaries
.fi
.SH Synopsis
.sv 1
.nf
.ta 0.1i 3i
JSERVER_DIR/usr/user_name/ud
.fi
.SH Description
.HP 0
.IP
This is the updatable kana-kanji conversion dictionary used by
uum(1). By default, this is made by uum on start up, but the position
and file name can be changed by uumrc(4). JSERVER_DIR is set to
jserver_dir in jserverrc(4) (in Debian it is /var/lib/wnn/ja_JP/dic/).
.SH "SEE ALSO"
.sv 1
.nf
uum(1), jserver(1), wnnenvrc(4), jserverrc(4), dtoa(1), atod(1).
========================================================================
--
Francis Bond <bond@ieee.org>