tech-userlevel archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: UTF8 (Was: [PATCH] replace 0xA0 to whitespace in plain text files (part 2))



der Mouse wrote:-

> Even things like the C compiler don't really work with characters.  The
> text "if" in a source file is not really "character i" "character f";
> rather, it's "octet 0x69" "octet 0x66" (well, typically - that's
> assuming ASCII was used when the compiler was built) - as it has to be,
> since there is no way to declare what character set the input uses.

This isn't true; the C standard says it is determined by the current
locale.  If you have a UTF-8 locale, your C compiler should expect
all source to be UTF-8.

I've written a C front end that does this just fine.

Neil.


Home | Main Index | Thread Index | Old Index