Subject: lib/25795: iconv doesn't support all of CP932
To: None <gnats-bugs@gnats.NetBSD.org>
From: None <khym@azeotrope.org>
List: netbsd-bugs
Date: 06/02/2004 16:16:16
>Number:         25795
>Category:       lib
>Synopsis:       iconv doesn't support all of CP932
>Confidential:   no
>Severity:       non-critical
>Priority:       medium
>Responsible:    lib-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Wed Jun 02 21:42:00 UTC 2004
>Closed-Date:
>Last-Modified:
>Originator:     Dave Huang
>Release:        NetBSD 2.0_BETA
>Organization:
Name: Dave Huang         |  Mammal, mammal / their names are called /
INet: khym@azeotrope.org |  they raise a paw / the bat, the cat /
FurryMUCK: Dahan         |  dolphin and dog / koala bear and hog -- TMBG
Dahan: Hani G Y+C 28 Y++ L+++ W- C++ T++ A+ E+ S++ V++ F- Q+++ P+ B+ PA+ PL++
>Environment:
	
	
System: NetBSD yerfable.azeotrope.org 2.0_BETA NetBSD 2.0_BETA (YERFABLE) #206: Mon May 31 05:53:47 CDT 2004 khym@yerfable.azeotrope.org:/usr2/obj.alpha/sys/arch/alpha/compile/YERFABLE alpha
Architecture: alpha
Machine: alpha
>Description:
	iconv -l claims that Microsoft codepage 932 (cp932) is a
supported character set, but it doesn't support all of the characters,
such as the circled numbers and roman numerals:
http://www.microsoft.com/globaldev/reference/dbcs/932/932_87.htm

It looks like NetBSD has a mapping table for those characters in
src/share/i18n/csmapper/JIS/JISX0208VDC@NEC%UCS.src, but it's unused.

>How-To-Repeat:
% printf '\x87\x40' | iconv -f cp932 -t utf-16 | hexdump -C
iconv: iconv(): Illegal byte sequence

The same command on a MacOS X system (with GNU libiconv) produces:
00000000  fe ff 24 60                                       |..$`|
00000004

>Fix:
>Release-Note:
>Audit-Trail:
>Unformatted: