Part I Features Overview
This part introduces the features of the Traditional Chinese Solaris Operating System
(Solaris OS).
Chapter 1 Overview of Features
The Traditional Chinese Solaris Operating System (Solaris
OS) is the internationalization and localization of the Solaris Operating System and the
Common Desktop Environment (CDE) window system.
This chapter describes the new features and the language support that
are available in the release of the current Traditional Chinese release.
Note –
The information in this book applies to the Traditional Chinese used
in Taiwan and Hong Kong .
New Localized Features
Several new features
are available in the current Solaris release of the Traditional Chinese Operating System.
-
UDC (user defined character) support for all of the Traditional Chinese
locales, which include zh_TW.EUC, zh_TW), zh_TW.BIG5, and zh_TW.UTF-8 locales.
You can define new characters with the sdtudctool
utility. You can then enter the characters with the native codepoint input
method. Finally, you can use the mp command with the -u option to print the characters.
-
HKSCS-2001 support in the Solaris Operating System.
HKSCS-1999 is replaced by the HKSCS-2001 standard in the zh_HK.BIG5HK locale and in all the Chinese UTF-8 locales (zh_CN.UTF-8, zh_HK.UTF-8, and zh_TW.UTF-8).
Changes in the HKSCS-2001 standard include:
-
116 new characters.
-
ISO 10646 mappings for most of the HKSCS-1999 characters.
The mappings that are defined in the Private User Area (PUA) of ISO/IEC 10646-1:1993
have changed to the corresponding code points of Unicode Supplementary Ideograph
Plane in Unicode 3.2.
-
Extended CNS11643 support.
In the Solaris 9 release, the zh_TW.EUC/zh_TW locale supported only the first 3 planes of CNS 11643-1992.
In the current Solaris Operating System, all of the CNS 11643-1992 planes are supported.
Note –
No fonts are available in this release for the characters in the
new supported planes.
-
Unicode 3.2 support. The zh__TW.UTF—8
and the zh_HK.UTF-8 locales have been updated to support
the new 3.2 version of the Unicode Standard. The new version introduces an
additional 1,016 new characters and contains various normative and informative
changes.
Unicode 3.2 also defines the following UTF-8 byte sequences as invalid.
-
0xED as the first byte
-
0xA0 to 0xBF as the second byte
These sequences exclude the surrogate code points between U+D800 and
U+DFFF. To comply with the new definition, the UTF-8 iconv
modules have been enhanced to detect the newly defined UTF-8 invalid byte
sequences.
-
In the current Solaris release,
the mp printing utility replaces the xetops
and the xutops utilities.
Note –
The xetops and xutops printing
utilities are no longer supported in the Solaris Operating System. The utilities were formerly
used to convert Traditional Chinese text files to PostScript. The conversion
enabled the printing of Traditional Chinese characters to PostScript printers with
no resident Asian fonts. The xetops utility was used in
the zh_TW.EUC/zh_TW and zh_TW.BIG5 locales. The xutops utility was used in the zh_TW.UTF-8 and zh_TW.HK locales.
The mp printing utility was first released with the
Solaris 9 Operating System.
Language Support
The Solaris system builds inherent internationalization features into
every localized product. Localization facilities support the ANSI C recommendations
for internationalization and localization that define the locale and related
categories.
Locales Attributes
A locale contains the culturally specific information
and conventions of the language for a particular global region. Each process
in the Solaris Operating System has the following set of locale attributes:
-
Locale settings, which provide the locale
and setlocale commands you use to list and set attributes
before you start a process from the command line.
For example, the Traditional Chinese locales and the English/ASCII locale
both have a category that defines the display of time and date according to
the cultural format, as well as the actual Traditional Chinese or English/ASCII
characters for the time and date.
-
Code Sets, which support coding conventions for the CNS11643
and BIG5 character sets. These sets enable you to input, display, and print Traditional Chinese
text in file names, system messages, and terminal (TTY), email, and data file
content.
-
htt input method server, which handles Traditional Chinese
input for the Solaris Operating System. The htt server receives your
keyboard input and converts it to Traditional Chinese characters that are used
in Traditional Chinese applications.
Traditional Chinese Locales
The Traditional Chinese Solaris Operating System provides simultaneous support
for the locales in the following table . The locales look the same to the
end user, but the internal character encoding is different.
Table 1–1 Traditional Chinese Locales
|
Locale
|
Description
|
|
zh_TW.EUC (zh_TW)
|
Traditional Chinese EUC locale (CNS11643-1992)
|
|
zh_TW.BIG5
|
Traditional Chinese BIG5
|
|
zh_TW.UTF-8
|
Traditional Chinese UTF-8 (Unicode 3.2)
|
|
zh_HK.BIG5HK
|
Traditional Chinese (Hong Kong) BIG5-HKSCS
|
|
zh_HK.UTF-8
|
Traditional Chinese (Hong Kong) UTF-8 (Unicode 3.2)
|
Traditional Chinese Code Sets
The following table lists supported code sets for each Traditional Chinese
locale.
Table 1–2 Traditional Chinese Code Sets
|
Locale
|
Code Set
|
|
zh_TW.EUC (zh_TW)
|
cns11643
|
|
zh_TW.BIG5
|
BIG5
|
|
zh_TW.UTF-8
|
UTF-8
|
|
zh_HK.BIG5HK
|
BIG5-HKSCS
|
|
zh_HK.UTF-8
|
Unicode 3.2
|
Traditional Chinese Input Methods and Fonts
The Traditional Chinese Solaris Operating System provides input methods and
fonts that enable you to enter, display, and print any character in any language.
The following input methods are supported for the Traditional Chinese locales.
-
New ChuYin
-
ChuYin
-
TsangChieh
-
Array
-
Boshiamy
-
DaYi
-
JianYi
-
Cantonese
-
NeiMa (EUC, BIG5, BIG5-HKSCS)
-
English-Chinese
-
Optional codetable Input Methods, such as PinYin
For a complete list of fonts supported for the Traditional Chinese locales,
see Chapter 9, Fonts.
Input Method Auxiliary Window
The input
method auxiliary window supports the following functions:
Locale Categories
You can use the following general and specific categories as defined
by ANSI C for the Traditional Chinese and English locales.
-
General LC_ALL setting that invokes all
of the categories for locale-related aspects of the system.
-
Specific settings for particular aspects of the system, which
include:
-
LC_CTYPE
-
LC_TIME
-
LC_NUMERIC
-
LC_MONETARY
-
LC_COLLATE
-
LC_MESSAGES
For example, the Traditional Chinese and the English/ASCII locales have the LC_TIME category that defines the display of the time and date according
to the cultural format, as well as the actual Traditional Chinese or English/ASCII
characters used in the display.