* setup2.sgml (setup-locale-ov): Document CJK ambiguous width change

for non-Unicode charsets.
	* new-features.sgml (ov-new1.7.8): Mention CJK ambiguous width change.
	Drop redundant NT4 pre-SP4 support entry.
This commit is contained in:
Corinna Vinschen 2010-11-18 11:05:46 +00:00
parent 617dc68bfe
commit 1570248800
3 changed files with 21 additions and 12 deletions

View File

@ -1,3 +1,10 @@
2010-10-18 Andy Koppe <andy.koppe@gmail.com>
* setup2.sgml (setup-locale-ov): Document CJK ambiguous width change
for non-Unicode charsets.
* new-features.sgml (ov-new1.7.8): Mention CJK ambiguous width change.
Drop redundant NT4 pre-SP4 support entry.
2010-10-09 Corinna Vinschen <corinna@vinschen.de>
* new-features.sgml (ov-new1.7.8): Document reinstantiation of the

View File

@ -35,7 +35,10 @@ Drop support for Windows NT4 prior to Service Pack 4.
</para></listitem>
<listitem><para>
Drop support for Windows NT4 prior to Service Pack 4.
Fix the width of "CJK Ambiguous Width" characters to 1 for singlebyte charsets
and 2 for East Asian multibyte charsets. (For UTF-8, it remains dependent on
the specified language, and the "@cjknarrow" locale modifier can still be used
to force width 1.)
</para></listitem>
</itemizedlist>

View File

@ -353,17 +353,16 @@ With the "@cyrillic" modifier it's UTF-8.
<listitem><para>
There's a class of characters in the Unicode character set, called the
"CJK Ambiguous Width Character set". For these characters the width
returned by the wcwidth/wcswidth function is usually 1. This is often a
problem in East-Asian languages, which historically use character sets
in which these characters have a width of 2. By default, the
wcwidth/wcswidth functions return 1 as the width of these characters,
except if the language is specifed as "ja" (Japanese), "ko" (Korean), or
"zh" (Chinese). In these languages wcwidth and wcswidth return 2 for
these characters. This is not correct in all circumstances, so the user
of one of these languages can specify the modifier "@cjknarrow", which
modifies the behaviour of wcwidth/wcswidth to return 1 for the ambiguous
width characters.
"CJK Ambiguous Width" characters. For these characters, the width
returned by the wcwidth/wcswidth functions is usually 1. This can be a
problem with East-Asian languages, which historically use character sets
where these characters have a width of 2. Therefore, wcwidth/wcswidth
return 2 as the width of these characters when an East-Asian charset such
as GBK or SJIS is selected, or when UTF-8 is selected and the language is
specified as "zh" (Chinese), "ja" (Japanese), or "ko" (Korean). This is
not correct in all circumstances, hence the locale modifier "@cjknarrow"
can be used to force wcwidth/wcswidth to return 1 for the ambiguous width
characters.
</para></listitem>
</itemizedlist>