Re: preliminary version control requirements

To: "Perry E. Metzger" <perry%piermont.com@localhost>
Subject: Re: preliminary version control requirements
From: Gavan Fantom <gavan%coolfactor.org@localhost>
Date: Fri, 01 Aug 2008 00:19:41 +0100

Perry E. Metzger wrote:

Preserving the old version numbers is important because they're
recorded all over everywhere (PRs, mailing lists, etc.) and losing the
ability to correlate all those reports with the source tree would be a
problem.


I think there isn't a single VCS that will do that right now. If that
were really a requirement, we could never upgrade from CVS. I suspect
we're just going to have to manage old file version information by
keeping the CVS tree around for a while.

I don't see why you couldn't do that as custom metadata. For example,you could do this in monotone by adding certificates to provide themapping between old CVS version numbers and monotone versions.

Meanwhile, there are real reasons why hash codes are bad; they're hard
for humans to use compared to small integers, and they're not ordered
so you can't tell if the version you've got is newer or older than
some version someone's posting about wihout digging around.


The lack of ordering is certainly a problem. The length might not be a
problem in practice -- git and the other systems that use them seem to
allow shortest unambiguous prefix, and that's generally only five or
six characters.

I don't think that the lack of ordering is as big a problem as somepeople make out.

The main thing that we seem to use ordering for is to answer thequestion, "Does this version include fix X?". Serialising all theversions is certainly one way to make it easy to answer that question,but providing that the tools exist to make it easy to answer thatquestion, I don't see the problem.

Again taking monotone as an example, and there's probably a better wayto do this, you can do:


mtn automate ancestors `mtn automate get_base_revision_id` | grep <id>

to discover if that ID is in the ancestry of the currently checked outrevision. (this could of course be put into a wrapper script with a morefriendly name)

Regarding the length of the revision ID - despite monotone offering theability to use short versions of the IDs, I've never felt the need touse them. When using a revision ID, I am either refering to itsymbolically via a tag or a selector (such as, "the head of thebranch"), or from previous output on my screen from where I cantrivially cut&paste.

What the hash-as-a-version-ID scheme buys you is not having to serialisecommits at a central server. In particular, it buys you offline commits.It also buys you the ability to do your merges *after* commiting code,so that you have a record of your changes as you originally made them,and if you screw up your merge then at least you can roll back and startthe merge again.

I do not know of any VCS which allows true offline commits but doesn'thave nonsequential version IDs.

Complete offline operation makes the world of difference to anybodyworking without a decent net connection, such as people hacking on codewhile on planes. Having been in that position myself recently, I can'tstress the benefits of this enough.

 > >    - supports rcsids/keyword expansion

>> Most of them don't per se do this, in that there aren't version

 > numbers for individual files in any modern system so there aren't
 > expandable RCSID equivalents with a file's individual version number.

Why would it have to be a file's individual version number? It just
has to indicate what stuff a binary file was built with. But we do
need something that does that.


That particular requirement can be met. However, there is an implicit
requirement here that I want to mention is not going to be met.

If you wanted to put an identifier of the source file into the binary,you could do a lot worse than to ask the VCS for the version number ofthe tree you're building from. Most distributed VCSs should be able togive you the actual version number that would result if you were toimmediately commit, and also the base version which was checked out fromthe repository before any local (not checked in) changes were applied.

We could conceivably get rid of all the existing rcsid strings and
arrange something that didn't require keyword expansion directly in
files, but it would be a lot of work and a big hassle.


I wouldn't mind having to replace them across the whole tree with
different names, that would be fairly straightforward work. I agree
that having to set up a rube goldberg mechanism would be a big issue.

I don't see a huge amount of value in maintaining this information on aper-file basis if the VCS deals with versioning whole trees. If you wantto embed a version number, embed the version number of the whole tree.Let's get rid of per-file RCSIDs if we move away from per-file versionnumbers.

Follow-Ups:
- Re: preliminary version control requirements
  - From: Dieter Baron
- Re: preliminary version control requirements
  - From: Martin Husemann

References:
- preliminary version control requirements
  - From: David Holland
- Re: preliminary version control requirements
  - From: Perry E. Metzger
- Re: preliminary version control requirements
  - From: David Holland
- Re: preliminary version control requirements
  - From: Perry E. Metzger

Prev by Date: Re: Some preliminary cvs2git conversion statistics
Next by Date: Re: is the proof in the pudding?
Previous by Thread: Re: preliminary version control requirements
Next by Thread: Re: preliminary version control requirements
Indexes:

Home | Main Index | Thread Index | Old Index