Subject: Re: PR/33392 CVS commit: src/dist/nawk
To: None <gnats-admin@netbsd.org, netbsd-bugs@netbsd.org,>
From: Aleksey Cheusov <cheusov@tut.by>
List: netbsd-bugs
Date: 07/03/2006 10:40:02
The following reply was made to PR bin/33392; it has been noted by GNATS.

From: Aleksey Cheusov <cheusov@tut.by>
To: gnats-bugs@NetBSD.org
Cc: gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: PR/33392 CVS commit: src/dist/nawk
Date: Mon, 03 Jul 2006 13:41:08 +0300

 >  | >  I think int is too wide. I made it unsigned short.
 >  | 
 >  | "640k is anough for everyone" ;)
 >  | Seriuosly, I often use awk with very large regexps for my work.
 >  | AFAIR, according to theory NFA for regexp
 >  | that looks like (a|b)*(a|b)^n has equivalent DFA with 2^N states, so 65536
 >  | states of DFA may correspond to NFA with only 16 (!!!) terminal
 >  | symbols.  IMHO this kind of internal limits is bad. I read
 >  | NetBSD philosophy but reality is that hardware changes fast.
 >  | My 5 years old Athlon-800/384Mb RAM is capable of propressing
 >  | DFAs including more than 2^16 states.
 >  | So, I personally would prefer 'int' type for the states.
 >  
 >  I thought that this is limited by NCHARS+3. I will change it.
 
 Changes you commited to the HEAD related to this PR seems good to me,
 everything works correctly and much faster than gawk (for huge
 regexps) that i used for years.
 
 1) have you a plan to notify Brian about bug found?
 2) have you a plan to add an additional regression test for awk?
 
 >  | P.S.
 >  | I saw HEAD changes in awk code and was surprized that
 >  | lots of snprintf functions was changed to sprintf,
 >  | and strlcpy to strcpy. Is this really ok?
 >  
 >  They were not done carefully so bugs were introduced and we decided
 >  to back them out until someone does them carefully.
 >  
 >  | P.P.S
 >  | Where is nawk upstream? Who maintains that YYYYMMDD versions?
 >  
 >  from /usr/src/doc/3RDPARTY.
 >  Package:        nawk
 >  Version:        2005-04-24
 >  Current Vers:   2005-04-24
 >  Maintainer:     Brian Kernighan <bwk@bell-labs.com> (Lucent Technologies)
 >  Archive Site:   http://cm.bell-labs.com/who/bwk/
 >  Home Page:      http://cm.bell-labs.com/who/bwk/
 >  
 
 
 -- 
 Best regards, Aleksey Cheusov.