tech-userlevel archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: Proposal: _ctype_ table bitwidth change

>>> 0xa0 as Unicode Code Point is not representable as unsigned char
>>> with UTF-8 encoding.
>> [...]
> There is no valid UTF-8 encoding of 0xA0 using a single octet.

Agreed.  Indeed, s/a single octet/other than two octets/.

> Period.  I haven't said anything else.

If that's what you meant, then I just misunderstood you.

This might be a not-too-inaccurate way to put it: I see a distinction
between an octet and a length-1 sequence of octets, just as I see a
distinction between codepoints and encodings of codepoints.

/~\ The ASCII                             Mouse
\ / Ribbon Campaign
 X  Against HTML      
/ \ Email!           7D C8 61 52 5D E7 2D 39  4E F1 31 3E E8 B3 27 4B

Home | Main Index | Thread Index | Old Index