tech-userlevel archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: A draft for a multibyte and multi-codepoint C string interface



On Wed, Apr 03, 2013 at 09:06:11AM -0400, Mouse wrote:
> > Non-NUL UTF8 sequences can contain bytes with value 0,
> 
> How?  As far as I can see, the only way to get a 0 octet into a
> UTF-8-encoded string is to encode Unicode codepoint 0.  RFC3629 seems
> to think so too:

Not at all.  I misunderstood something long ago, and reading the RFC
has made it clear to me that I was wrong.

Thor


Home | Main Index | Thread Index | Old Index