Solaris Internationalization Guide For Developers
  Cerca solo questo libro
Scarica il manuale in formato PDF
CHAPTER 3

Contents of the Localized Solaris 2.6 Products



The European Localized Solaris 2.6 Product

European Solaris is available in three localized versions: French, German, and European. All three versions of Solaris share the same software media, which includes a fully localized CDE environment, error messages, and online documentation in six languages--French, German, Spanish, Swedish, Italian, and English. The difference is in the printed documentation. The French and German Solaris include localized printed documentation, while the printed documentation for the European version is in English only.
TABLE 3-1 shows a list of locales in the European product. This includes both full and partial locales.
TABLE 3-1
Locale NameLanguage/Territory
CPOSIX English (7 bit)
czCzech Republic
daDenmark
deGermany
de_ATAustria
de_CHSwitzerland
TABLE 3-1 (Continued)
Locale NameLanguage/Territory
elGreece
en_AUAustralia
en_CACanada
en_IEIreland
en_NZNew Zealand
en_UKUnited Kingdom
en_USU.S.A.
esSpanish
es_ARArgentina
es_BOBolivia
es_CLChile
es_COColombia
es_CRCosta Rica
es_ECEcuador
es_GTGuatemala
es_MXMexico
es_NINicaragua
es_PAPanama
es_PEPeru
es_PYParaguay
es_SVEl Salvador
es_UYUruguay
es_VEVenezuela
etEstonia
frFrance
fr_BEBelgium (French)
fr_CACanada (French)
fr_CHSwitzerland (French)
huHungary
itItaly
TABLE 3-1 (Continued)
Locale NameLanguage/Territory
ltLithuania
lvLatvia
nlNetherlands
nl_BENetherlands/Belgium
noNorway
plPoland
ptPortugal
pt_BRPortuguese Brazil
ruRussian
suFinland
svSweden
All of these locales are also present in the base Solaris 2.6 release. However, only the European product contains the localized messages.
As mentioned, the locales include partial locales. These are based on core locales for the main language. For example, the fr_CA (French Canadian) is based on the fr (French) locale. These partial locales utilize the messages that are delivered into its parent locale (French for fr_CA). If a locale hasn't been fully localized, then it may contain only English messages.
A number of Eastern European locales have also been added into Solaris 2.6. Previously Sun locales were based on ISO-8859-1. The Eastern European locales are based on other ISO standards, as shown in TABLE 3-2.
Locales that are not listed are still based on ISO-8859-1.
TABLE 3-2
Locale NameLanguage/TerritoryISO
de_ATGerman (Austrian)8859-1
etEstonian8859-1
czCzech8859-2
huHungarian8859-2
plPolish8859-2
lvLatvian8859-4
TABLE 3-2 (Continued)
Locale NameLanguage/TerritoryISO
ltLithuanian8859-4
ruRussian8859-5
elGreek8859-7
trTurkish8859-9
All of the locales support character input and output. There is also iconv support for many of the major codesets. (For more on iconv, see the man pages.) The iconv modules are available on the end-user cluster of the Euro product and on the entire cluster of other products, including the base product. See TABLE 3-3 for details.
TABLE 3-3 iconv
CodeSymbolTarget CodeSymbolComment
ISO 8859-2iso2MS 1250win2Windows Latin 2
ISO 8859-2iso2MS 852dos2MS-DOS Latin 2
ISO 8859-2iso2MazoviamazMazovia
ISO 8859-2iso2DHNdhnDom Handlowy Nauki
MS 1250win2ISO 8859-2iso2ISO Latin 2
MS 1250win2MS 852dos2MS-DOS Latin 2
MS 1250win2MazoviamazMazovia
MS 1250win2DHNdhnDom Handlowy Naduki
MS 852dos2ISO 8859-2iso2ISO Latin 2
MS 852dos2MS 1250win2Windows Latin 2
MS 852dos2MazoviamazMazovia
MS 852dos2DHNdhnDom Handlowy Nauki
MazoviamazISO 8859-2iso2ISO Latin 2
MazoviamazMS 1250win2Windows Latin 2
MazoviamazMS 852dos2MS-DOS Latin 2
MazoviamazDHNdhnDom Handlowy Nauki
DHNdhnISO 8859-2iso2ISO Latin 2
DHNdhnMS 1250win2Windows Latin 2
DHNdhnMS 852dos2MS-DOS latin 2
DHNdhnMazoviamazMazovia
TABLE 3-3 iconv(Continued)
CodeSymbolTarget CodeSymbolComment
ISO 8859-5iso5KOI8-Rkoi8KOI8-R
ISO 8859-5iso5PC CyrillicaltAlternative PC Cyrillic
ISO 8859-5iso5MS 1251win5Window Cyrillic
ISO 8859-5iso5Mac CyrillicmacMacintosh Cyrillic
KOI8-Rkoi8ISO 8859-5iso5ISO 8859-5 Cyrillic
KOI8-Rkoi8PC CyrillicaltAlternative PC Cyrillic
KOI8-Rkoi8MS 1251win5Windows Cyrillic
KOI8-Rkoi8Mac CyrillicmacMacintosh Cyrillic
PC CyrillicaltISO 8859-5iso5ISO 8859-5 Cyrillic
PC CyrillicaltKOI8-Rkoi8KOI8-R
PC CyrillicaltMS 1251win5Windows Cyrillic
PC CyrillicaltMac CyrillicmacMacintosh Cyrillic
MS 1251win5ISO 8859-5iso5ISO 8859-5 Cyrillic
MS 1251win5KOI8-Rkoi8KOI8-R
MS 1251win5PC CyrillicaltAlternative PC Cyrillic
MS 1251win5Mac CyrillicmacMacintosh Cyrillic
Mac CyrillicmacISO 8859-5iso5ISO 8859-5 Cyrillic
Mac CyrillicmacKOI8-Rkoi8KOI8-R
Mac CyrillicmacPC CyrillicaltAlternative PC Cyrillic
Mac CyrillicmacMS 1251win5Windows Cyrillic

Font Formats

There are many different font formats. The extension lets you determine the font type.
  • PostScript Type 1 Fonts PostScript Type 1 fonts, which are also known as Adobe Type Manager (ATM) fonts, Type 1, and outline fonts, contains information in outline form that allows a PostScript printer or ATM to generate fonts of any size. Most of these fonts also contain hinting information which allows fonts to be rendered more readable at a low resolution or a small type size.
  • Bitmap Fonts

    Bitmap fonts contain a picture of the font at a specific size that has been optimized to look good at that specific size. If the font is scaled larger or smaller, the quality may degrade. On the other hand, bitmap fonts display quickly.

Location of Fonts on the System

Fonts are located at:
/usr/openwin/lib/locale/iso_8859_x/X11/fonts/X11/Type1/afm

or
/usr/openwin/lib/locale/iso_8859_x/X11/fonts/X11/75dpi

Adding and Removing Font Packages

To manually add font packages to the system:
  1. Always add the required font packages before the optional font packages.

  2. When you are removing font packages from the system, remove the optional font packages first.

You must follow this procedure in adding or removing fonts. The class action scripts in the font packages depend on this for proper function. The new optional font packages contain scripts that concatenate information onto the required font packages that are already resident on the system. If the required font packages are not there, problems may occur.

The Asian Localized Solaris 2.6 Products

In contrast to the European locales, which are all packaged on one CD, there are four separate Asian products on separate CDs: Japanese, Korean, Traditional Chinese, and Simplified Chinese.
The following table shows the Asian locales supported by these Asian products.
TABLE 3-4
CD SetLocale NameDescriptionSupported Character Set
Koreanko
ko.UTF-8
Korean
Korean (UTF-8 locale)
KS C 5601-1992
KS C 5700-1995
Simplified ChinesezhSimplified Chinese (Mainland China)GB 2312-1980
Traditional
Chinese
zh_TW
zh_TW.BIG5
Traditional Chinese (Taiwan)
Traditional Chinese (BIG5 locale)
CNS 11643
BIG5
Japaneseja
ja_JP.PCK
JapaneseJIS x 0201-1976
JIS x 0208-1990
JIS x 0212-1990

Korean

In December 1995, the Korean government announced a new standard Korean codeset, KSC-5700, which is based on ISO-10646-1/Unicode 2.0. The new standard codeset replaces KSC 5601, which was based on ISO-2022.
The ISO-10646 character set uses 2 (UCS-2; Universal Character Set two-byte form) or 4 (UCS-4) bytes to represent each character.
The ISO-10646 character set cannot be used directly on IBM-PC-based operating systems. For example, the kernel and many other modules of the Solaris operating environment interpret certain byte values as control instructions, such as a null character (0x00) in any string. The ISO-10646 character set can be encoded with any bit combinations in the first or subsequent bytes. The ISO-10646 characters cannot be freely transmitted through the Solaris system with the above limitations. In order to establish a migration path, the ISO-10646 character set defines the UCS Transformation Format (UTF), which recodes the ISO-10646 characters without using C0 controls (0x00..0x1F), C1 controls (0x80..0x9F), space (0x20), and DEL (0x7F).
The ko.UTF-8 is a new Solaris locale to support KSC-5700, the new Korean standard codeset. It supports all characters in the previous KSC 5601 and all 11,172 Korean characters. Korean UTF-8 supports only the Korean language-related ISO-10646 characters and fonts. Because ISO-10646 covers all characters in the world, it is necessary to supply all of the various input methods and fonts so that you may
input and output any character in any language. Before Universal UTF/UCS becomes available, Korean UTF-8 supports only the ISO-10646 code subset that is related to Korean characters and all other characters in the previous Korean standard codeset as well as Extended ASCII.
TABLE 3-5 lists the Korean codesets.
TABLE 3-5 koko.UTF-8
CodeSymbolTargetCodeSymbol
UTF-8ko_KR-UTF-8Wansungko_KR-euc
UTF-8ko_KR-UTF-8Johapko_KR-johap92
UTF-8ko_KR-UTF-8Packedko_KR-johap
UTF-8ko_KR-UTF-8ISO-2022-KRko_KR-iso2022-7
Wansungko_KR-eucUTF-8ko_KR-UTF-8
Johapko_KR-johap92UTF-8ko_KR-UTF-8
Packedko_KR-johapUTF-8ko_KR-UTF-8
ISO-2022-KRko_KR-iso2022-7UTF-8ko_KR-UTF-8
Wansungko_KR-eucJohapko_KR-johap92
Wansungko_KR-eucPackedko_KR-johap
Wansungko_KR-eucN-Byteko_KR-nbyte
Wansungko_KR-eucISO-2022-KRko_KR-iso2022-7
Johapko_KR-johap92Wansungko_KR-euc
Packedko_KR-johapWansungko_KR-euc
N-Byteko_KR-nbyteWansungko_KR-euc
ISO-2022-KRko_KR-iso2022-7Wansungko_KR-euc

Chinese: Simplified and Traditional

Chinese is written in two standards: Simplified and Traditional.
The People's Republic of China (P.R.C.) uses Simplified Chinese. In five-year steps, Chinese characters, which are often composed of an elaborate number of marks, are being simplified to make it quicker to write and easier to develop technology. The number of characters is also being reduced.
Simplified Chinese Solaris uses the EUC scheme to support the PRC Chinese national standard character set GB2312-80.
The Republic of China (R.O.C.) in Taiwan continues to use Traditional Chinese. Taiwan has a significant computer industry. They are the world leaders in production of laptops. The Taiwanese computer industry uses two mechanisms to produce Traditional Chinese: CNS-11643 and Big-5 encoding. The CNS-11643 codeset is used by the R.O.C. government. The Big-5 codeset is used by industry and private users, especially PC users.
Traditional Chinese Solaris currently uses the EUC scheme to support the government's CNS-11643 codeset. Since many PC applications handle only Big-5 code, there is a significant market demand to support Big-5 code in Traditional Chinese Solaris. Therefore, Solaris 2.6 supports the Big-5 locale. The Big-5 locale allows users to exchange Big-5 encoded files between PC and Solaris 2.6 without conversion procedures. The official name of the Big-5 locale is zh_TW.BIG5.
TABLE 3-6 shows the supported codeset conversions for Simplified Chinese.
TABLE 3-6
CodeSymbolTargetCodeSymbol
GB2312-80zh_CN.eucISO 2022-7zh_CN.iso2022-7
ISO 2022-7zh_CN.iso2022-7GB2312-80zh_CN.euc
GB2312-80zh_CN.eucISO 2022-CNzh_CN.iso2022-CN
ISO-2022-CNzh_CN.iso2022-CNGB2312-80zh_CN.euc
UTF-8UTF-8GB2312-80zh_CN.euc
GB2312-80zh_CN.eucUTF-8UTF-8
TABLE 3-7 shows the supported codeset conversions for Traditional Chinese.
TABLE 3-7
CodeSymbolTargetCodeSymbol
CNS 11643zh_TW-eucBig-5zh_TW-big5
CNS 11643zh_TW-eucISO 2022-7zh_TW-iso2022-7
Big-5zh_TW-big5CNS 11643zh_TW-euc
Big-5zh_TW-big5ISO 2022-7zh_TW-iso2022-7
ISO 2022-7zh_TW-iso2022-7CNS 11643zh_TW-euc
ISO 2022-7zh_TW-iso2022-7Big-5zh_TW-big5
CNS 11643zh_TW-euISO 2022-CN-EXTzh_TW-iso2022-CN-EXT
ISO 2022-CN-EXTzh_TW-iso2022-CN-EXTCNS 11643zh_TW-euc
TABLE 3-7 (Continued)
CodeSymbolTargetCodeSymbol
Big-5zh_TW-big5ISO 2022-CNzh_TW-iso2022-CN
ISO 2022-CNzh_TW-iso2022-CNBigzh_TW-big5
UTF-8UTF-8CNS 11643zh_TW-euc
CNS 11643zh_TW-eucUTF-8UTF-8
UTF-8UTF-8Big-5zh_TW-big5
Big-5zh_TW-big5UTF-8UTF-8
UTF-8UTF-8ISO 2022-7zh_TW-iso2022-7
ISO 2022-7 zh_TW-iso2022-7UTF-8UTF-8
ISO 2022-CN-EXTzh_TW-iso2022-CN-EXBig-5zh_TW-big5
Big-5zh_TW-big5ISO 2022-CN-EXTzh_TW-iso2022-CN-EXT

Japanese

Three Japanese input systems are available for Japanese Solaris 2.6. They can be used in the ja and ja_JP.PCK locale. However, some maintenance utilities do not support the PCK codeset.
TABLE 3-8
TypeDescription
Wnn6Wnn6 consists of the Kana-Kanji conversion server (jserver), interface module for htt (X Input Method Server) called xjsi.so, utilities, dictionaries, and configuration files. Wnn6 is co-packaged with Japanese Solaris 2.6. Wnn6 for Solaris 2.3/2.4 is distributed by OMRON SOFTWARE. Wnn6 for Solaris 2.5/2.5.1 is included in Update CD of Solaris and distributed by Nihon SMCC.
ATOK8ATOK8 consists of atok8 X Input Method Server, utilities, and dictionaries. SunSoft had been releasing ATOK7 from Japanese Solaris 2.1 until 2.5.1. ATOK8 replaces ATOK7. ATOK is a popular Japanese input facility for the Japanese PC market. It is distributed by JUSTSYSTEM. ATOK8 has been co-packaged since Japanese Solaris 2.5.
cs00cs00 consists of Kana-Kanji conversion server (cs00), interface module for htt (X Input Method Server) called xci.so, utilities, and dictionaries. cs00 is the only set of these three systems that has been bundled. It has been included since Japanese Solaris 2.1.
TABLE 3-9
Full Family NameSubfamilyFormatVendorEncoding
hg gothic bRTrueTypeRICOHJISX0208.1983,
JISX0201.1976
hg mincho lRTrueTypeRICOHJISX0208.1983,
JISX0201.1976
heiseiminRTrueTypeRICOHJISX0212.1990
TABLE 3-10
Full Family NameSubfamilyFormatVendorEncoding
ryumin light klRF3MORISAWAJISX0208.1983,
JISX0201.1976
gothic medium
bbb
RF3MORISAWAJISX0208.1989,
JISX0201.1976
TABLE 3-11
Full Family NameSubfamilyFormatVendorEncoding
gothicR, BPCF(12,14,16,20,24
JISX0208.1983,
JISX0201.1976
minchouRPCF(12,14,16,20,24)
JISX0208.1983,
JISX0201.1976
hg gothic bRPCF(12,14,16,18,20,24)RICOHJISX0208.1983,
JISX0201.1976
hg mincho lRPCF(12,14,16,18,20,2)RICOHJISX0208.1983,
JISX0201.1976
ryumin light
kl
RPCF(10,12,14,16,18,20,22)MORIS
AWA
JISX0208.1983,
JISX0201.1976
gothic
medium bbb
RPCF(10,12,14,16,18,20,22)MORIS
AWA
JISX0208.1983,
JISX0201.1976
heiseiminRPCF(12,14,16,18,20,24)RICOHJISX0212.1990

Note - The F3 fonts and F3 bitmap fonts (Morisawa fonts) will be supported only up to and including the next Solaris release.

Japanese Locales

Japanese Solaris 2.6 supports two locales. The ja locale is based on Japanese EUC. The ja_JP.PCK locale is based on PC-Kanji code. See the eucJP(5) or PCK(5) man page for more details.

Japanese Messages and man Pages

Some messages and manual pages have been translated into Japanese in Japanese Solaris 2.6.

Japanese Character Code Converter for iconv

The following table shows supported conversion with iconv(1) and iconv(3). See the iconv_ja(5) man page for details.
TABLE 3-12 iconv
Source CodeTarget Code
eucJPPCK
eucJPJIS7
eucJPSJIS
eucJPUTF-8
eucJPjis
eucJPibmj
SJISeucJP
SJISISO-2022-JP
SJISUTF-8
SJISjis
SJISibmj
PCKeucJP
PCKUTF-8
PCKISO-2022-JP
PCKjis
PCKibmj
ISO-2022-JPeucJP
TABLE 3-12 iconv(Continued)
Source CodeTarget Code
ISO-2022-JPPCK
ISO-2022-JPSJIS
UTF-8eucJP
UTF-8SJIS
UTF-8PCK
JIS7eucJP
jiseucJP
jisPCK
jisSJIS
ibmjeucJP
ibmjPCK
ibmjSJIS

Japanese-specific Commands

The following commands are for handling Japanese data in Japanese Solaris 2.6. See the jistoeuc(1), euctoibmj(1), jtty(1), jtops(1), or kanji(1) man pages for more details.
TABLE 3-13
CommandComments
jistoeucConverts JIS to Japanese EUC
jistosjConverts JIS to PC Kanji
euctojisConverts Japanese EUC to JIS
euctosjConverts Japanese EUC to PC Kanji
sjtojisConverts PC Kanji to JIS
sjtoeucConverts PC Kanji to Japanese EUC
euctoibmjConverts Japanese EUC to IBM-Japanese
ibmjtoeucConverts IBM-Japanese to Japanese EUC
TABLE 3-13 (Continued)
CommandComments
jttySets Japanese terminal characteristics
jtopsPostScript filter for printing Japanese character codeset
kanjiShows the list of Kanji codes with numbers of EUC, ja_JP.PCK, JIS and JIS kuten code
We recommend using iconv code conversion module instead of jistoeuc(1) or euctoibmj(1) converters. These last two converters are provided to allow backward compatibility.

Japanese Character Code Converter for TTY STREAMS

These are TTY STREAMS modules that are used to input and output Japanese characters on terminals. Usually setterm(1) organizes these modules/command properly for the user environment. tty(1) controls the behavior of those STREAMS modules.

Japanese-specific Printer Support

Japanese Solaris 2.6 supports the following Japanese-specific printers:
  • Epson VP-5085 (based on ESC/P)
  • NEC PC-PR201 (based on 201PL)
  • Canon LASERSHOT (based on LIPS)
  • Japanese PostScript Printer

JLE Binary Compatibility Package

Japanese Solaris 2.6 also provides Japanese Solaris1.1.x binary-compatibility packages the same as the base products.

User-Defined Character (UDC) Support

Several of the font tools available in the Solaris package are:
  • The User-Defined Character (UDC) font editor handles both outline (Type1) and bitmap (PCF) fonts: /usr/dt/bin/sdtudctool
  • OpenWindows font editor for bitmap font: /usr/openwin/bin/fontedit
  • OpenWindows Type3 font editor: /usr/openwin/bin/type3creator

Note - fontedit, type3creator, and fontmanager will be supported only up to and including the next Solaris release.