Subject: Re: Japanese documents needed for testing
To: Masao Uebayashi <uebayasi@soum.co.jp>
From: Bang Jun-Young <bjy@mogua.org>
List: tech-kern
Date: 02/09/2001 13:09:19
Masao Uebayashi wrote:
> I appended some.

Thanks for your quick reply. :)

> 
> If you want to create/convert Japanese text yourself, using "Jcode"
> module with Perl is easy.

is Jcode part of the standard Perl distribution? Or, can it be found
in the package collection?

> 
> If you want ISO-2022-JP text, do
> 
>   % perl -MJcode -e 'my $j=Jcode->new(); while (<>) { print $j->set($_)->jis; }' <japanese.txt >japanese.txt.iso-2022-jp

I've been a little confused about what exactly JIS is. Is ISO-2022-JP 
current JIS? Is Unicode not? In Korea, we have two standards (KS), one is 
based on EUC and the other on Unicode (in fact, there's another standard
but nobody is using it nowadays).

> 
> If UTF-8,
> 
>   % perl -MJcode -e 'my $j=Jcode->new(); while (<>) { print $j->set($_)->utf8; }' <japanese.txt >japanese.txt.utf8
> 
> No idea for UTF-16.

UTF-16 won't be supported right now, since it seriously breaks 
compatibility with many of existing apps. It's just for testing purpose.

Jun-Young

--
Bang Jun-Young <bjy@mogua.org>