tech-userlevel archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: Unicode programming



> One of the _other_ bugaboos with UTF-16 is that you need to keep
> track of the byte order and/or insert a BOM to deambiguate what kind
> of stream you're generating.

Strictly speaking, not really.  UTF-16 serializes a stream of Unicode
codepoints to a stream of 16-bit entities.  There is no byte order
issue there.

Byte order is relevant only if you insist on converting between that
stream of 16-bit entities and a stream of 8-bit entities.  (This can
happen in various ways, most likely by using a machine whose addressing
quantum is 8 bits and viewing the containing memory differently.)

/~\ The ASCII                             Mouse
\ / Ribbon Campaign
 X  Against HTML                mouse%rodents-montreal.org@localhost
/ \ Email!           7D C8 61 52 5D E7 2D 39  4E F1 31 3E E8 B3 27 4B


Home | Main Index | Thread Index | Old Index