tech-repository archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: The first step away from CVS



On 2010-01-12 05:14 +0000 (Tue), David Holland wrote:

> A good first step would be identifying all the commit messages that
> have non-ASCII text....

Just as a quick note for this, looking for chars with the 8th bit set
is not enough. For example, ISO-2022-JP is a 7-bit encoding. I'd say
that anything with control characters (c < ' ') outside of tabs is
suspicious, though it's probably adequate to look just for messages
containing any of SI, SO and ESC.

cjs
-- 
Curt Sampson         <cjs%cynic.net@localhost>         +81 90 7737 2974
             http://www.starling-software.com
The power of accurate observation is commonly called cynicism
by those who have not got it.    --George Bernard Shaw


Home | Main Index | Thread Index | Old Index