GNU bug report logs - #28038
multibyte: expand: expand(1) lacks MBC support

Previous Next

Package: coreutils;

Reported by: Tilman Schmidt <tschmidt <at> cardtech.de>

Date: Thu, 10 Aug 2017 16:11:01 UTC

Severity: wishlist

To reply to this bug, email your comments to 28038 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#28038; Package coreutils. (Thu, 10 Aug 2017 16:11:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to Tilman Schmidt <tschmidt <at> cardtech.de>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Thu, 10 Aug 2017 16:11:01 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Tilman Schmidt <tschmidt <at> cardtech.de>
To: bug-coreutils <at> gnu.org
Subject: expand(1) lacks MBC support
Date: Thu, 10 Aug 2017 18:10:02 +0200
[Message part 1 (text/plain, inline)]
Hi,

it seems the expand(1) command does not properly support multi-byte
characters.

tschmidt <at> sl-vm-redmine01:~$ echo $LANG
de_DE.UTF-8
tschmidt <at> sl-vm-redmine01:~$ cat test.txt
Text	ohne	Umlaute
Täxt	müt	Umläuten
tschmidt <at> sl-vm-redmine01:~$ expand test.txt
Text    ohne    Umlaute
Täxt   müt    Umläuten

Using Ubuntu 14.04.5 LTS with coreutils 8.21-1ubuntu.

Regards,
Tilman

-- 
Tilman Schmidt
Teamleiter Systemadministration

Tel. 0221 / 95 64 95 . 417
Fax  0221 / 95 64 95 . 699

eMail tschmidt <at> cardtech.de

cardtech
Card & POS Service GmbH
Richard-Byrd-Straße 37
50829 Köln
www.cardtech.de

AG Köln, HRB 20164
Geschäftsführer: Dr. Dietrich Gottwald, Christof Kohns

[signature.asc (application/pgp-signature, attachment)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#28038; Package coreutils. (Sat, 12 Aug 2017 00:00:02 GMT) Full text and rfc822 format available.

Message #8 received at 28038 <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: Tilman Schmidt <tschmidt <at> cardtech.de>, 28038 <at> debbugs.gnu.org
Subject: Re: bug#28038: expand(1) lacks MBC support
Date: Fri, 11 Aug 2017 17:58:48 -0600
Hello Tilman,

On 10/08/17 10:10 AM, Tilman Schmidt wrote:
> it seems the expand(1) command does not properly support multi-byte
> characters.

That is correct.

> tschmidt <at> sl-vm-redmine01:~$ cat test.txt
> Text	ohne	Umlaute
> Täxt	müt	Umläuten
> tschmidt <at> sl-vm-redmine01:~$ expand test.txt
> Text    ohne    Umlaute
> Täxt   müt    Umläuten
> 
> Using Ubuntu 14.04.5 LTS with coreutils 8.21-1ubuntu.

Multibyte support is not available yet (neither in version 8.21 which is
4 years old, nor in the current version 8.27).

However, there is an on-going effort to add multibyte support
to all coreutils programs, including 'expand'.

You can read more technical details about it here:
  http://crashcourse.housegordon.org/coreutils-multibyte-support.html

In the current (work-in-progress) internationalization patch,
the 'expand' program does support multibyte locales, and expands
your input correctly:

multibyte locale:

   $ ./src/expand bug28038.txt
   Text    ohne    Umlaute
   Täxt    müt     Umläuten

versus forcing single-byte locale:

   $ LC_ALL=C ./src/expand bug28038.txt
   Text    ohne    Umlaute
   Täxt   müt    Umläuten


The latest version of the patch is available for download and
experimentation here:
  http://lists.gnu.org/archive/html/coreutils/2017-04/msg00009.html
However it should not be considered stable.

regards,
 - assaf






Severity set to 'wishlist' from 'normal' Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Tue, 30 Oct 2018 01:06:02 GMT) Full text and rfc822 format available.

Changed bug title to 'multibyte: expand: expand(1) lacks MBC support' from 'expand(1) lacks MBC support' Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Tue, 30 Oct 2018 01:06:02 GMT) Full text and rfc822 format available.

This bug report was last modified 5 years and 178 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.