NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: bin/57616: sed(1) is unable to process multibyte unicode characters properly



The following reply was made to PR bin/57616; it has been noted by GNATS.

From: mlelstv%serpens.de@localhost (Michael van Elst)
To: gnats-bugs%netbsd.org@localhost
Cc: 
Subject: Re: bin/57616: sed(1) is unable to process multibyte unicode characters properly
Date: Mon, 11 Sep 2023 15:03:24 -0000 (UTC)

 marc.fege%uni-bonn.de@localhost writes:
 
 >NetBSD rpi 9.3 NetBSD 9.3 (RPI) #0: Thu Aug  4 15:30:37 UTC 2022  mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/evbarm/compile/RPI evbarm
 
 >sed(1) has a problem processing multibyte unicode characters properly.
 
 >     echo "abcÄÖÜxyz" | sed 's/./& /g'
 >I expect the following output format for further processing:
 >     "a b c Ä Ö Ü x y z "
 
 
 It's not actually about sed failing but what the underlying regexp
 library can do.
 
 Wide char support ("NLS") from FreeBSD was integrated in 2021 and
 will be in NetBSD-10.
 


Home | Main Index | Thread Index | Old Index