Subject: CVS commit: pkgsrc/net/wget
To: None <pkgsrc-changes@netbsd.org>
From: Thomas Klausner <wiz@netbsd.org>
List: pkgsrc-changes
Date: 12/16/2002 13:39:02
Module Name:	pkgsrc
Committed By:	wiz
Date:		Mon Dec 16 11:39:02 UTC 2002

Modified Files:
	pkgsrc/net/wget: Makefile PLIST distinfo
	pkgsrc/net/wget/patches: patch-ad patch-ae patch-af patch-ag
Added Files:
	pkgsrc/net/wget/patches: patch-ac

Log Message:
Update to 1.8.2, with the recently discovered security problem
already fixed (patches from RedHat RPM).

Note: this package does not support IPv6 any longer, since no
updated IPv6 patch for wget-1.8.2 exists (to my knowledge).
The next major wget release is expected to contain IPv6 support.

Uservisible changes since 1.7:
** A new progress indicator is now available and used by default.
You can choose the progress bar type with `--progress=TYPE'.  Two
types are available, "bar" (the new default), and "dot" (the old
dotted indicator).  You can permanently revert to the old progress
indicator by putting `progress = dot' in your `.wgetrc'.

** You can limit the download rate of the retrieval using the
`--limit-rate' option.  For example, `wget --limit-rate=15k URL' will
tell Wget not to download the body of the URL faster than 15 kilobytes
per second.

** Recursive retrieval and link conversion have been revamped:

*** Wget now traverses links breadth-first.  This makes the
calculation of depth much more reliable than before.  Also, recursive
downloads are faster and consume *significantly* less memory than
before.

*** Links are converted only when the entire retrieval is complete.
This is the only safe thing to do, as only then is it known what URLs
have been downloaded.

*** BASE tags are handled correctly when converting links.  Since Wget
already resolves <base href="..."> when resolving handling URLs, link
conversion now makes the BASE tags point to an empty string.

*** HTML anchors are now handled correctly.  Links to an anchor in the
same document (<a href="#anchorname">), which used to confuse Wget,
are now converted correctly.

*** When in page-requisites (-p) mode, no-parent (-np) is ignored when
retrieving for inline images, stylesheets, and other documents needed
to display the page.

*** Page-requisites (-p) mode now works with frames.  In other words,
`wget -p URL-THAT-USES-FRAMES' will now download the frame HTML files,
and all the files that they need to be displayed properly.

** `--base' now works conjunction with `--input-file', providing a
base for each URL and thereby allowing the URLs in the file to be
relative.

** If a host has more than one IP address, Wget uses the other
addresses when accessing the first one fails.

** Host directories now contain port information if the URL is at a
non-standard port.

** Wget now supports the robots.txt directives specified in
<http://www.robotstxt.org/wc/norobots-rfc.txt>.

** URL parser has been fixed, especially the infamous overzealous
quoting.  Wget no longer dequotes reserved characters, e.g. `%3F' is
no longer translated to `?', nor `%2B' to `+'.  Unsafe characters
which are not reserved are still escaped, of course.

** No more than 20 successive redirections are allowed.


To generate a diff of this commit:
cvs rdiff -r1.53 -r1.54 pkgsrc/net/wget/Makefile
cvs rdiff -r1.5 -r1.6 pkgsrc/net/wget/PLIST
cvs rdiff -r1.14 -r1.15 pkgsrc/net/wget/distinfo
cvs rdiff -r0 -r1.5 pkgsrc/net/wget/patches/patch-ac
cvs rdiff -r1.5 -r1.6 pkgsrc/net/wget/patches/patch-ad \
    pkgsrc/net/wget/patches/patch-ae
cvs rdiff -r1.3 -r1.4 pkgsrc/net/wget/patches/patch-af \
    pkgsrc/net/wget/patches/patch-ag

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.