Subject: bin/24929: sed(1) has problems with non-ASCII chars
To: None <gnats-bugs@gnats.netbsd.org>
From: None <wiz@NetBSD.org>
List: netbsd-bugs
Date: 03/26/2004 22:15:19
>Number:         24929
>Category:       bin
>Synopsis:       sed(1) has problems with non-ASCII chars
>Confidential:   no
>Severity:       critical
>Priority:       medium
>Responsible:    bin-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Fri Mar 26 21:16:00 UTC 2004
>Closed-Date:
>Last-Modified:
>Originator:     Thomas Klausner
>Release:        NetBSD 1.6ZL/20040325
>Organization:
	
>Environment:
	
	
>Description:
sed's '.' (any char) does not match Umlaute or other non-ASCII chars.
>How-To-Repeat:
Try to build emulators/wine.
In dlls/shell32, it does something similar to:
sed -E -e 's/(.*)/  \"\1\",/' < AUTHORS
where AUTHORS contains lines like:
Sami Aario
Karl Backström
Maxime Bellengé
Dennis Björklund

This results in:
  "Sami Aario",
  "Karl Backstr",öm
  "Maxime Belleng",é
  "Dennis Bj",örklund
instead of
  "Sami Aario",
  "Karl Backström",
  "Maxime Bellengé",
  "Dennis Björklund",
as intended, and compilation failure later on.
>Fix:
Not provided, sorry.
>Release-Note:
>Audit-Trail:
>Unformatted: