Re: The first step away from CVS

To: tech-repository%NetBSD.org@localhost
Subject: Re: The first step away from CVS
From: Lloyd Parkes <Lloyd.Parkes%ird.govt.nz@localhost>
Date: Mon, 11 Jan 2010 08:46:49 +1300

Joerg Sonnenberger wrote:

On Thu, Jan 07, 2010 at 08:44:58AM +1300, Lloyd Parkes wrote:

I've been having a look at Git and Mercurial and I've worked out
that the first thing we need to do (regardless of what we do second)
is to convert all the CVS commit messages to UTF-8.


The far majority of all commit messages are either ASCII or Latin1.

Certainly the majority of commit messages are US-ASCII, but I expect that theremainder contain a good variety of character sets. Our Japanese colleagues canbe quite prolific. The bulk of each message is US-ASCII, but the committers seemto write their names using their local character sets from time to time.

Ignoring the few remaining cases is not that problematic.

Ignored? How? Some versioning systems require that the character set beidentified and the two character sets you just mentioned do not cover the whole8 bit range, so not only would we get some things incorrectly encoded asLatin-1, but we will also get commit messages that cannot be encoded (values0x7f to 9f are not valid Latin-1 characters).

IMHO the versioning systems that do require that the character set be identifiedare better designed and less hackish than the others.

I have spent maybe 15 years running IMAP servers, and I used to run somemoderately large ones and this problem came up there from time to time beforeMIME became endemic. In my experience there is no substitute for gettingcharacter sets right from the beginning and I think we now have an opportunityto sort this out while we still have a repository that is amenable to dirty tricks.


Cheers,
Lloyd

Follow-Ups:
- Re: The first step away from CVS
  - From: S.P.Zeidler

References:
- The first step away from CVS
  - From: Lloyd Parkes
- Re: The first step away from CVS
  - From: Joerg Sonnenberger

Prev by Date: Re: git copies of cvs modules available
Next by Date: Re: git copies of cvs modules available
Previous by Thread: Re: The first step away from CVS
Next by Thread: Re: The first step away from CVS
Indexes:

Home | Main Index | Thread Index | Old Index