On Thu, Feb 28, 2019 at 7:39 PM Marc Nieper-Wißkirchen <xxxxxx@nieper-wisskirchen.de> wrote:
Am Do., 28. Feb. 2019 um 12:04 Uhr schrieb Alex Shinn <xxxxxx@gmail.com>:
>
> That doesn't even make sense - Cyrillic is Cyrillic, whatever the encoding is.
> It's a property of the terminal whether or not Cyrillic characters are displayed single-width or double.

Unicode Annex #11 refers to the character encoding when it comes to
deciding the width of ambiguous characters; see definition ED6 here:
https://unicode.org/reports/tr11/.

I looked into this, it appears to be a historic accident.

ASCII characters got special treatment in JIS, but as additional scripts
were added (Cyrillic and Greek), they were just treated as additional
CJK chars with full-width glyphs.  In Japan both of these scripts are
primarily used for emoji, however among the few users who actually
want to read and write Cyrillic on Japanese terminals and devices
(myself included) it seems universally considered a misfeature.

The TR may have been trying to reflect the common use of kterm
(which I verified still has that abominable behavior) but I doubt this
is a majority anymore.  Considering the encoding of the terminal
seems a total hack.  It might be a reasonable proxy given the
predominance of Unicode aware terminals and the fact that anyone
explicitly using an euc-jp environment is more likely to be using
kterm or similar, but I'd rather have a generic state variable.

--
Alex