GNU bug report logs -
#28038
multibyte: expand: expand(1) lacks MBC support
Previous Next
To reply to this bug, email your comments to 28038 AT debbugs.gnu.org.
Toggle the display of automated, internal messages from the tracker.
Report forwarded
to
bug-coreutils <at> gnu.org
:
bug#28038
; Package
coreutils
.
(Thu, 10 Aug 2017 16:11:01 GMT)
Full text and
rfc822 format available.
Acknowledgement sent
to
Tilman Schmidt <tschmidt <at> cardtech.de>
:
New bug report received and forwarded. Copy sent to
bug-coreutils <at> gnu.org
.
(Thu, 10 Aug 2017 16:11:01 GMT)
Full text and
rfc822 format available.
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
Hi,
it seems the expand(1) command does not properly support multi-byte
characters.
tschmidt <at> sl-vm-redmine01:~$ echo $LANG
de_DE.UTF-8
tschmidt <at> sl-vm-redmine01:~$ cat test.txt
Text ohne Umlaute
Täxt müt Umläuten
tschmidt <at> sl-vm-redmine01:~$ expand test.txt
Text ohne Umlaute
Täxt müt Umläuten
Using Ubuntu 14.04.5 LTS with coreutils 8.21-1ubuntu.
Regards,
Tilman
--
Tilman Schmidt
Teamleiter Systemadministration
Tel. 0221 / 95 64 95 . 417
Fax 0221 / 95 64 95 . 699
eMail tschmidt <at> cardtech.de
cardtech
Card & POS Service GmbH
Richard-Byrd-Straße 37
50829 Köln
www.cardtech.de
AG Köln, HRB 20164
Geschäftsführer: Dr. Dietrich Gottwald, Christof Kohns
[signature.asc (application/pgp-signature, attachment)]
Information forwarded
to
bug-coreutils <at> gnu.org
:
bug#28038
; Package
coreutils
.
(Sat, 12 Aug 2017 00:00:02 GMT)
Full text and
rfc822 format available.
Message #8 received at 28038 <at> debbugs.gnu.org (full text, mbox):
Hello Tilman,
On 10/08/17 10:10 AM, Tilman Schmidt wrote:
> it seems the expand(1) command does not properly support multi-byte
> characters.
That is correct.
> tschmidt <at> sl-vm-redmine01:~$ cat test.txt
> Text ohne Umlaute
> Täxt müt Umläuten
> tschmidt <at> sl-vm-redmine01:~$ expand test.txt
> Text ohne Umlaute
> Täxt müt Umläuten
>
> Using Ubuntu 14.04.5 LTS with coreutils 8.21-1ubuntu.
Multibyte support is not available yet (neither in version 8.21 which is
4 years old, nor in the current version 8.27).
However, there is an on-going effort to add multibyte support
to all coreutils programs, including 'expand'.
You can read more technical details about it here:
http://crashcourse.housegordon.org/coreutils-multibyte-support.html
In the current (work-in-progress) internationalization patch,
the 'expand' program does support multibyte locales, and expands
your input correctly:
multibyte locale:
$ ./src/expand bug28038.txt
Text ohne Umlaute
Täxt müt Umläuten
versus forcing single-byte locale:
$ LC_ALL=C ./src/expand bug28038.txt
Text ohne Umlaute
Täxt müt Umläuten
The latest version of the patch is available for download and
experimentation here:
http://lists.gnu.org/archive/html/coreutils/2017-04/msg00009.html
However it should not be considered stable.
regards,
- assaf
Severity set to 'wishlist' from 'normal'
Request was from
Assaf Gordon <assafgordon <at> gmail.com>
to
control <at> debbugs.gnu.org
.
(Tue, 30 Oct 2018 01:06:02 GMT)
Full text and
rfc822 format available.
Changed bug title to 'multibyte: expand: expand(1) lacks MBC support' from 'expand(1) lacks MBC support'
Request was from
Assaf Gordon <assafgordon <at> gmail.com>
to
control <at> debbugs.gnu.org
.
(Tue, 30 Oct 2018 01:06:02 GMT)
Full text and
rfc822 format available.
This bug report was last modified 5 years and 178 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.