Re: using the interfaces in ctype.h

To: der Mouse <mouse%Rodents.Montreal.QC.CA@localhost>
Subject: Re: using the interfaces in ctype.h
From: Terry Moore <tmm%mcci.com@localhost>
Date: Mon, 21 Apr 2008 01:15:49 -0400

At 10:18 PM 4/20/2008 -0400, der Mouse wrote:

> There are two cases one has to consider for use of isspace() etc.

> 1) if x is an int (wider than a char), and is the result of getc(),
> then x will be in the range [-1, UCHAR_MAX].

No; x will be in the range [0..UCHAR_MAX] or be EOF, which latter
happens to be -1 in our implementation but may be different.  (Unless
you were speaking from a strictly NetBSD perspective, rather than a
correct-use-of-ctype perspective.  Your mention of 1's-complement
machines makes me think not.)

Thank you for pointing that out. I apologize. I had overlooked thatEOF is a negative integer, but is not required to be -1.

However, I think that it's still true to say that if x is EOF,isprint(x & UCHAR_MAX) will not (generally) be the same asisprint(x), even though isprint(x & UCHAR_MAX) is always valid. Thiswas my point. (My point was not that (x & UCHAR_MAX) has anyparticular value.)

I am (per C99) assuming that UCHAR_MAX is one less than a power oftwo, so that x & UCHAR_MAX is valid and equivalent to (x % (UCHAR_MAX+1)).


So both cases still apply, I think.

> The phrase
>          .. if (isspace((unsigned char) buf[0])) ...
> won't work if isspace() is in-line and there's not enough casting in
> the macro.

I can't see how it could fail.  Could you give an example?

It will fail by generating the warning which prevents compilationwith -Werror on some machines. See Greg's other messages -- that'swhat started this discussion. (Apparently there are some compilersthat complain about indexing using (unsigned chars) -- probably thosemachines on which char is identical to unsigned char, but I'm guessing.)

> I'm running 3.1, so I may have the wrong header files; but this would
> imply that (for example) isspace() should change from
>    ((int)((_ctype_ + 1)[(c)] & _S)
> to
>    ((int)((_ctype_ + 1)[(int)(c)] & _S)

I think this would be a very bad idea.  The existing code draws
warnings from some compiler versions about "array subscript has type
char", which let a coder catch such sloppy code; while this doesn't
apply to 3.1's compiler in my experience, doing it for 3.1 leads to the
idea of doing it for later versions, for which it *does* matter.

It happens for 3.1 and gcc for x86. My point was that I don't havethe 4.0 or more recent header files to hand. My point also was thatthis makes the <ctypeh.h> is...() macros formally at leastinconsistent with the C99 definitions (which require int, and forwhich a (char) argument will silently be widened).

I think we can agree (by looking at C99) that the standard definitionof isspace() is 'int isspace(int)'. NetBSD's definition of macros isconvenient, but is not mandated by the standard (in fact, thestandard does not give special discussion to any of the <ctype.h>functions if implemented as macros).

I can't find a place where C99 requires that any implementation of afunction-like macro for a library function be "warning-equivalent" tocalling the library function. In other words, C99 does not requirethat isspace(x) be "warning-equivalent" to (isspace)(x). But Ihappen to think that it's in the spirit of the specification forisspace(x), even though I agree that doing so may beinconvenient. However, it's more portable, because (isspace)(x) isnot likely to give a warning -- and if it does, the warning will bemuch more like what Coverity might give, e.g. "x is not in {EOF,0..UCHAR_MAX}", rather than the rather inscrutable gcc message.

As far as I can tell, ultimately it comes down to an implementationchoice, as C99 does not give clear guidance.

--Terry

Follow-Ups:
- Re: using the interfaces in ctype.h
  - From: der Mouse

References:
- Re: using the interfaces in ctype.h
  - From: Christos Zoulas
- Re: using the interfaces in ctype.h
  - From: Greg A. Woods; Planix, Inc.
- Re: using the interfaces in ctype.h
  - From: Terry Moore
- Re: using the interfaces in ctype.h
  - From: Greg A. Woods; Planix, Inc.
- Re: using the interfaces in ctype.h
  - From: Terry Moore
- Re: using the interfaces in ctype.h
  - From: der Mouse

Prev by Date: Re: constification ?
Next by Date: Re: using the interfaces in ctype.h
Previous by Thread: Re: using the interfaces in ctype.h
Next by Thread: Re: using the interfaces in ctype.h
Indexes:

Home | Main Index | Thread Index | Old Index