Subject: Re: WWW query engine bug (was Query-PR)
To: der Mouse <mouse@Collatz.McRCIM.McGill.EDU>
From: None <Chris_G_Demetriou@NIAGARA.NECTAR.CS.CMU.EDU>
List: current-users
Date: 02/21/1996 17:06:29
> > 	(3) if the user does a 'long-range' <html>, perhaps one which
> > 	    is never closed, how does the scanner deal with that?  some
> > 	    of the PRs are gigantic, and i think it's unreasonable to
> > 	    have to have it parse them completely before it processes
> > 	    any of them.
> 
> I don't see why there's any need to.  Your scanner just has to keep a
> bit saying whether it's inside an unclosed <HTML>...</HTML>, and if
> it's not, just do mindless mapping of < to &lt;, etc.

...  How do you check if the <html> is closed, without parsing the
entire file?   (i.e. do you say that the <html></html> may only be 1,
2, N lines long?  etc.)

by doing this, it becomes much harder to implement the program as a
filter...


cgd