Solaris Internationalization Guide For Developers
只搜尋這本書
查看這本書:
以 PDF 格式下載這本書 (483 KB)

Chapter 3 Contents of the Localized Solaris 7 Products

The European Localized Solaris 7 Product

European Solaris is available in three localized versions: French, German, and European. All three versions of Solaris share the same software media, which includes a fully localized CDE environment, error messages, and on-line documentation in six languages--French, German, Spanish, Swedish, Italian, and English. The difference is in the printed documentation. The French and German Solaris products include localized printed documentation, while the printed documentation for the European version is in English only.

Table 3-1 shows a list of locales in the European product. This includes both full and partial locales.

Table 3-1 European 7 Locales

Locale Name

Language/Territory

C

POSIX English (7-bit) ASCII C

cz

Czech Republic

da

Denmark

de

Germany

de_AT

Austria

de_CH

Switzerland

de.ISO8859-15

Germany

el

Greece

en_AU

Australia

en_CA

Canada

en_IE

Ireland

en_NZ

New Zealand

en_UK

Great Britain

en_US

U.S.

es

Spain

es_AR

Argentina

es_BO

Bolivia

es_CL

Chile

es_CO

Colombia

es_CR

Costa Rica

es_EC

Ecuador

es_GT

Guatemala

es_MX

Mexico

es_NI

Nicaragua

es_PA

Panama

es_PE

Peru

es_PY

Paraguay

es_SV

El Salvador

es_UY

Uruguay

es_VE

Venezuela

et

Estonia

fr

France

fr_BE

Belgium (French)

fr_CA

Canada (French)

fr_CH

Switzerland (French)

fr.ISO8859-15

France

fr.UTF-8

France

hu

Hungary

it.ISO8859-15

Italy

it.UTF-8

Italy

it.ISO8859-15

Italy

lt.ISO8859-13

Lithuania

lv.ISO8859-13

Latvia

nl

Netherlands

nl_BE

Netherlands/Belgium

no

Norway

pl

Poland

pt_BR

Portuguese Brazil

ru

Russia

it.ISO8859-15

Italy

es.ISO8859-15

Spain

sv.ISO8859-15

Sweden

en_EU.ISO8859-15

Europe

en_GB.ISO8895-15

Britain

fr_BE.ISO8895-15

Belgium

nl.ISO8895-15

Netherlands

nl_BE.ISO8895-15

Belgium

pt.ISO8895-15

Portugal

de.-AT.ISO8895-15

Austria

en_IE.ISO8859-15

Ireland

da.ISO8859-15

Denmark

fi.ISO8859-15

Finland

el_EURO

Greece

sun_eu_greek

Greece

de.UTF-8

Germany

de.ISO8859-15

Germany

fr.UTF-8

France

it.UTF-8

Italy

es.UTF-8

Spain

es.ISO8859-15

Spain

sv.UTF-8

Sweden

sv.ISO8859-15

Sweden

en_UTF.8

Europe

en_ISO8859-15

Europe

All of these locales are also present in the base Solaris 7 release.

As mentioned, the locales include partial locales. These are based on core locales for the main language. For example, the fr_CA (French Canadian) is based on the fr (French) locale. These partial locales utilize the messages that are delivered into its parent locale (French for fr_CA). If a locale hasn't been fully localized, then it may contain only English messages.

A number of Eastern European locales have also been added into the Solaris 7 product, which may be based on other ISO standards. Previously Sun locales were based on ISO-8859-1. The Eastern European locales are based on other ISO standards, as shown in Table 3-2.

Locales that are not listed are still based on ISO-8859-1.

Table 3-2 Eastern European Locales in the Solaris 7 Product

Locale Name

Language/Territory

ISO

de_AT

German (Austrian)

8859-1

et

Estonian

8859-15

cz

Czech

8859-2

hu

Hungarian

8859-2

pl

Polish

8859-2

lv

Latvian

8859-13

lt

Lithuanian

8859-13

ru

Russian

8859-5

el

Greek

8859-7

tr

Turkish

8859-9

sq_AL

Albanian

8859-2

sk_SK

Slovakian

8859-2

sl_SL

Slovenian

8859-2

hr_HR

Croatian

8859-2

nr

Bosnian

8859-2

ro_RO

Romanian

8859-2

sr_SP

Serbian

8859-5

bg_BG

Bulgarian

8859-5

mk_MK

Macedonian

8859-5

ru.KOI8-R

Russian

KOI8-R

ar

Arabic

8859-6

he

Hebrew

8859-8

th_TH

Thai

8859-11 (TIS 620.2533)

All of the locales support character input and output. There is also iconv support for many of the major codesets. (For more on iconv, see iconv(1)The iconv modules are available on the end-user cluster of the Euro product. See Table 3-3 for details.

Table 3-3 iconv Support

Code

Symbol

Target Code

Symbol

Comment

ISO 8859-2

iso2

MS 1250

win2

Windows Latin 2

ISO 8859-2

iso2

MS 852

dos2

MS-DOS Latin 2

ISO 8859-2

iso2

Mazovia

maz

Mazovia

ISO 8859-2

iso2

DHN

dhn

Dom Handlowy Nauki

MS 1250

win2

ISO 8859-2

iso2

ISO Latin 2

MS 1250

win2

MS 852

dos2

MS-DOS Latin 2

MS 1250

win2

Mazovia

maz

Mazovia

MS 1250

win2

DHN

dhn

Dom Handlowy Naduki

MS 852

dos2

ISO 8859-2

iso2

ISO Latin 2

MS 852

dos2

MS 1250

win2

Windows Latin 2

MS 852

dos2

Mazovia

maz

Mazovia

MS 852

dos2

DHN

dhn

Dom Handlowy Nauki

Mazovia

maz

ISO 8859-2

iso2

ISO Latin 2

Mazovia

maz

MS 1250

win2

Windows Latin 2

Mazovia

maz

MS 852

dos2

MS-DOS Latin 2

Mazovia

maz

DHN

dhn

Dom Handlowy Nauki

DHN

dhn

ISO 8859-2

iso2

ISO Latin 2

DHN

dhn

MS 1250

win2

Windows Latin 2

DHN

dhn

MS 852

dos2

MS-DOS latin 2

DHN

dhn

Mazovia

maz

Mazovia

ISO 8859-5

iso5

KOI8-R

koi8

KOI8-R

ISO 8859-5

iso5

PC Cyrillic

alt

Alternative PC Cyrillic

ISO 8859-5

iso5

MS 1251

win5

Windows Cyrillic

ISO 8859-5

iso5

Mac Cyrillic

mac

Macintosh Cyrillic

OKI8-R

koi8

ISO 8859-5

iso5

ISO 8859-5 Cyrillic

KOI8-R

koi8

PC Cyrillic

alt

Alternative PC Cyrillic

KOI8-R

koi8

MS 1251

win5

Windows Cyrillic

KOI8-R

koi8

Mac Cyrillic

mac

Macintosh Cyrillic

PC Cyrillic

alt

ISO 8859-5

iso5

ISO 8859-5 Cyrillic

PC Cyrillic

alt

KOI8-R

koi8

KOI8-R

PC Cyrillic

alt

MS 1251

win5

Windows Cyrillic

PC Cyrillic

alt

Mac Cyrillic

mac

Macintosh Cyrillic

MS 1251

win5

ISO 8859-5

iso5

ISO 8859-5 Cyrillic

MS 1251

win5

KOI8-R

koi8

KOI8-R

MS 1251

win5

PC Cyrillic

alt

Alternative PC Cyrillic

MS 1251

win5

Mac Cyrillic

mac

Macintosh Cyrillic

Mac Cyrillic

mac

ISO 8859-5

iso5

ISO 8859-5 Cyrillic

Mac Cyrillic

mac

KOI8-R

koi8

KOI8-R

Mac Cyrillic

mac

PC Cyrillic

alt

Alternative PC Cyrillic

Mac Cyrillic

mac

MS 1251

win5

Windows Cyrillic

Table 3-4 contains a list of the Solaris 7 environment locales and their corresponding codeset names.

Table 3-4 New Locales and Corresponding Codeset Names

Locale

nl_langinfo (CODESET)

ICONV name

Product

arISO8859-6ISO8859-6Base/Euro
bg_BG ISO8859-5ISO8859-5Base/Euro
C646646Base/Euro
czISO8859-2ISO8859-2Base/Euro
daISO8859-1ISO8859-1Base/Euro
da.ISO8859-15 ISO8859-15ISO8859-15Base/Euro
de ISO8859-1ISO8859-1Base/Euro
de.ISO8859-15ISO8859-15ISO8859-15Base/Euro
de_UTF-8UTF-8UTF-8Base/Euro
de_ATISO8859-1ISO8859-1Base/Euro
de_AT.ISO8859-15 ISO8859-15ISO8859-15Base/Euro
de_CHISO8859-1ISO8859-1Base/Euro
elISO8859-7ISO8859-7Base/Euro
el.sun_eu_greek ISO8859-15ISO8859-15Base/Euro
en_AU ISO8859-1ISO8859-1Base/Euro
en_CA ISO8859-1ISO8859-1Base/Euro
en_EU.ISO8859-15ISO8859-15ISO8859-1Base/Euro
en_EU.UTF-8UTF-8UTF-8Base/Euro
en_GBISO8859-1ISO8859-1Base/Euro
en_GB.ISO8859-15ISO8859-15ISO8859-1Base/Euro
en_IEISO8859-1ISO8859-1Base/Euro
en_IE.ISO8859-15ISO8859-15ISO8859-1Base/Euro
en_NZISO8859-1ISO8859-1Base/Euro
en_USISO8859-1ISO8859-1Base/Euro
en_US.UTF-8UTF-8UTF-8Base/Euro
esISO8859-1ISO8859-1Base/Euro
es.ISO8859-15ISO8859-15ISO8859-15Base/Euro
es_ARISO8859-1ISO8859-1Base/Euro
es_BOISO8859-1ISO8859-1Base/Euro
es_CLISO8859-1ISO8859-1Base/Euro
es_COISO8859-1ISO8859-1Base/Euro
es_CRISO8859-1ISO8859-1Base/Euro
es_ECISO8859-1ISO8859-1Base/Euro
es_GTISO8859-1ISO8859-1Base/Euro
es_MXISO8859-1ISO8859-1Base/Euro
es-NIISO8859-1ISO8859-1Base/Euro
es_PAISO8859-1ISO8859-1Base/Euro
es_PEISO8859-1ISO8859-1Base/Euro
es_PYISO8859-1ISO8859-1Base/Euro
es_SVISO8859-1ISO8859-1Base/Euro
es.UTF-8UTF-8UTF-8Base/Euro
es_UYISO8859-1ISO8859-1Base/Euro
et_VEISO8859-1ISO8859-1Base/Euro
etISO8859-1ISO8859-1Base/Euro
fiISO8859-1ISO8859-1Base/Euro
fi.IOO8859-15ISO8859-15ISO8859-15Base/Euro
frISO8859-1ISO8859-1Base/Euro
fr.ISO8859-15ISO8859-15ISO8859-15Base/Euro
fr.UTF-8UTF-8UTF-8Base/Euro
fr_BEISO8859-1ISO8859-1Base/Euro
fr_BE.ISO8859-15ISO8859-15ISO8859-15Base/Euro
fr_CAISO8859-1ISO8859-1Base/Euro
fr_CHISO8859-1ISO8859-1Base/Euro
heISO8859-8ISO8859-8Base/Euro
he_ILISO8859-8ISO8859-8Base/Euro
hr_HRISO8859-2ISO8859-2Base/Euro
huISO8859-2ISO8859-2Base/Euro
itISO8859-1ISO8859-1Base/Euro
it.ISO8859-15ISO8859-15ISO8859-15Base/Euro
it.UTF-8UTF-8UTF-8Base/Euro
jaeucJPeucJPJapanese
ja_JP.PCKPCKPCKJapanese
ja_JP.UTF-8UTF-8UTF-8Japanese
ko5601ko_KR-eucKorean
ko.UTF-8UTF-8UTF-8Korean
ltISO8859-4ISO8859-4Base/Euro
lvISO8859-4ISO8859-4Base/Euro
mk_MKISO8859-5ISO8859-5Base/Euro
nlISO8859-1ISO8859-1Base/Euro
nl.ISO8859-15ISO8859-15ISO8859-15Base/Euro
nl_BEISO8859-1ISO8859-1Base/Euro
nl_BE.ISO8859-15ISO8859-15ISO8859-15Base/Euro
noISO8859-1ISO8859-1Base/Euro
no_NYISO8859-1ISO8859-1Base/Euro
nrISO8859-2ISO8859-2Base/Euro
plISO8859-2ISO8859-2Base/Euro
POSIX646646Base/Euro
ptISO8859-1ISO8859-1Base/Euro
pt.ISO8859-15ISO8859-15ISO8859-15Base/Euro
pt_BRISO8859-1ISO8859-1Base/Euro
ro_ROISO8859-2ISO8859-2Base/Euro
ruISO8859-5ISO8859-5Base/Euro
ru.KOI8-RKOI8-RKOI8-RBase/Euro
sk_SKISO8859-2ISO8859-2Base/Euro
sl_SIISO8859-2ISO8859-2Base/Euro
sq_ALISO8859-2ISO8859-2Base/Euro
sr_SPISO8859-5ISO8859-5Base/Euro
svISO8859-1ISO8859-1Base/Euro
sv.ISO8859-15ISO8859-15ISO8859-15Base/Euro
sv.UTF-8UTF-8UTF-8Base/Euro
th_THTIS620.2533TIS620.2533Base/Euro
trISO8859-9ISO8859-9Base/Euro
zhgb2312gb2312Simplified Chinese
zh.GBKGBKzh_CN.gbkSimplified Chinese
zh_TWcns11643zh_TW-eucTraditional Chinese
zh_TW.BIG5BIG5zh_TW_Big5Traditional Chinese


Note -

Locale naming conventions are as follows:

language[_territory][.codeset] where language is from ISO639 and territory is from ISO3166.

All locales with Base/Euro in the Product column are also available as Japanese, Korean, Simplified Chinese, and Traditional Chinese products.

All Solaris product locales preserve the Portable Character Set characters with US-ASCII code values.



Note -

5601 signifies the Korean EUC codeset containing KS C 5636 and KS C 5601-1987.

646 signifies ISO/IEC 646, which is US-ASCII.

eucJP signifies the Japanese EUC codeset. It contains JIS X0201-1976, JIS X0208-1983, and JIS X0212-1990.

gb2312 signifies Simplified Chinese EUC codeset, which contains GV 1988-80 and GB 2312-80.

PCK is also known as Shift JIS (SJIS).

UTF-8 is the UTF-8 of ISO/IEC 10646-1 containing various approved amendments and UNICODE 2.1

GBK signifies GB extensions. This includes all GB 2312-80 characters and all Unified Han characters of ISO/IEC 10646-1, as well as Japanese Hiragana and Katagana characters. It also includes many characters of Chinese, Japanese, and Korean character sets and of ISO/IEC 10646-1.


Font Formats

There are many different font formats. The extension lets you determine the font type.

  • PostScript Type 1 Fonts , which are also known as Adobe Type Manager (ATM) fonts, Type 1, and outline fonts, contain information in outline form that allows a PostScript printer or ATM to generate fonts of any size. Most of these fonts also contain hints that allow fonts to be rendered more readable at a low resolution or a small type size.

  • Bitmap Fonts contain a picture of the font at a specific size that has been optimized to look good at that specific size. If the font is scaled larger or smaller, the quality may degrade. On the other hand, bitmap fonts display quickly.

Location of Fonts on the System

Fonts are located at:

/usr/openwin/lib/locale/iso_8859_x/X11/fonts/X11/Type1/afm

or

/usr/openwin/lib/locale/iso_8859_x/X11/fonts/X11/75dpi
Adding and Removing Font Packages

To manually add font packages to the system:

  1. Always add the required font packages before the optional font packages.

  2. When you are removing font packages from the system, remove the optional font packages first.

You must follow this procedure to add or remove fonts. The class action scripts in the font packages depend on this for proper function. The optional font packages contain scripts that concatenate information onto the required font packages that are already resident on the system. If the required font packages are not there, problems may occur.

Summary of Asian Locales

Table 3-6 shows the Asian locales supported by these Asian products.

Table 3-5 Summary of Asian Locales

CD Set

Locale Name

Description

Supported Character Set

Korean

ko UTF-8

Korean (UTF-8 locale)

KS C 5601-1992

KS C 5700-1995

Simplified Chinese

zh GBK

Simplified Chinese ()EUC)

Simplified Chinese (GBK )

GB 2312-1980

GBK

Traditional Chinese

zh_TW zh_TW.BIG5

Traditional Chinese (EUC)

Traditional Chinese (BIG5)

CNS 11643 1992

BIG5

Japanese

ja

ja_JP.PCK

ja_JP.UTF-8

Japanese EUC

Japanese PCK [ja_JP.PCK doesn't support JIS x 0212-1990]

Japanese UTF-8

JIS x 0201-1976

JIS x 0208-1990

JIS x 0212-1990

VDC [VDC: Vendor Defined Character. VDCs occupy unused (reserved) code points of JIS X 0208-1990 or JIS X 0212-1990]

UDC [UDC: User Defined Character. UDCs occupy unused (reserved) code points of JIS X 0208-1990 or JIS X 0212-1990 (also unused for VDCs.)]

Korean in the Solaris 7 Product

In December 1995, the Korean government announced a standard Korean codeset, KSC-5700, which is based on ISO-10646-1/Unicode 2.0. The standard codeset replaces KSC 5601, which was based on ISO-2022.

The ISO-10646 character set uses 2 (UCS-2; Universal Character Set two-byte form) or 4 (UCS-4) bytes to represent each character.

The ISO-10646 character set cannot be used directly on IBM-PC-based operating systems. For example, the kernel and many other modules of the Solaris operating environment interpret certain byte values as control instructions, such as a null character (0x00) in any string. The ISO-10646 character set can be encoded with any bit combinations in the first or subsequent bytes. The ISO-10646 characters cannot be freely transmitted through the Solaris system with these limitations. In order to establish a migration path, the ISO-10646 character set defines the UCS Transformation Format (UTF), which recodes the ISO-10646 characters without using C0 controls (0x00..0x1F), C1 controls (0x80..0x9F), space (0x20), and DEL (0x7F).

The ko.UTF-8 is a Solaris locale to support KSC-5700, the Korean standard codeset. It supports all characters in the previous KSC 5601 and all 11,172 Korean characters. Korean UTF-8 supports the Korean language-related ISO-10646 characters and fonts. Because ISO-10646 covers all characters in the world, all of the various input methods and fonts are supplied so that you may input and output any character in any language. Before Universal UTF/UCS becomes available, Korean UTF-8 supports the ISO-10646 code subset that is related to Korean characters as well as all other characters in the previous Korean standard codeset, and Extended ASCII.

Table 3-6 lists the Korean codesets.

Table 3-6 Codeset Conversions Supported for Korean ko, ko.UTF-8

Code

Symbol

TargetCode

Symbol

UTF-8

ko_KR-UTF-8

Wansung

ko_KR-euc

UTF-8

ko_KR-UTF-8

Johap

ko_KR-johap92

UTF-8

ko_KR-UTF-8

Packed

ko_KR-johap

UTF-8

ko_KR-UTF-8

ISO-2022-KR

ko_KR-iso2022-7

Wansung

ko_KR-euc

UTF-8

ko_KR-UTF-8

Johap

ko_KR-johap92

UTF-8

ko_KR-UTF-8

Packed

ko_KR-johap

UTF-8

ko_KR-UTF-8

ISO-2022-KR

ko_KR-iso2022-7

UTF-8

ko_KR-UTF-8

Wansung

ko_KR-euc

Johap

ko_KR-johap92

Wansung

ko_KR-euc

Packed

ko_KR-johap

Wansung

ko_KR-euc

N-Byte

ko_KR-nbyte

Wansung

ko_KR-euc

ISO-2022-KR

ko_KR-iso2022-7

Johap

ko_KR-johap92

Wansung

ko_KR-euc

Packed

ko_KR-johap

Wansung

ko_KR-euc

N-Byte

ko_KR-nbyte

Wansung

ko_KR-euc

ISO-2022-KR

ko_KR-iso2022-7

Wansung

ko_KR-euc

Chinese: Simplified and Traditional

Simplified Chinese in the Solaris 7 environment provides two locales: zh and zh.GBK. In the zh locale, the EUC scheme is usesd to encode GB2312-80 The zh.GBK locale supports the GBK codeset, which is a superset of GB2312-80.

Simplified Chinese is used mostly in the People's Republic of China (PRC) and in Singapore..

The following input methods are supported for the zh locale

  • New QuanPin

  • New ShuangPin

  • Quanpy

  • Location

  • PinYin

  • Stroke

  • Golden

  • Intelligent Pinyin

  • Simplified Chinese Symbol

The following input methods are supported for the zh.GBK locale

  • New QuanPin

  • New ShuangPin

  • Quanpy

  • GBK Code

  • Japanese

  • Hanja

  • Zhuyin

  • Unicode

Table 3-7 shows the TrueType Fonts for the zh Locale

Table 3-7 Solaris 7 TrueType Fonts for the zh Locale
Full Family NameSubfamilyFormatVendorEncoding
FangsongRTrueTypeHanyiGB2312.1980
HeiRTrueTypeMonotypeGB2312.1980
KaiRTrueTypeMonotypeGB2312.1980
SongRTrueTypeMonotypeGB2312.1980

Table 3-8 shows the Bitmap Fonts for the zh Locale

Table 3-8 Solaris 7 Bitmap Fonts for the zh Locale
Full Family NameSubfamilyFormatEncoding
SongBPCF (14,16)GB2312.1980
SongRPCF (12,14,16,20,24)GB2312.1980

Table 3-9 shows the TrueType Fonts for the zh.GBK Locale

Table 3-9 TrueType Fonts for the zh.GBK Locale
Full Family NameSSubfamilyFormatVendorEncoding
FansongRTrueTypeZhongyiGBK
HeiRTrueTypeZhongyiGBK
KaiRTrueTypeZhongyiGBK
SongRTrueTypeZhongyiGBK

Table 3-10 shows the Bitmap Fonts for the zh.GBK Locale

Table 3-10 Bitmap Fonts for the zh.GBK Locale
Full Family NameSubfamilyFormatEncoding
SongRPCF (12,14,16,20,24)GBK

Table 3-11 shows the supported codeset conversions for Simplified Chinese.

Table 3-11 Codeset Conversions for Simplified Chinese

Code

Symbol

TargetCode

Symbol

GB2312-80

zh_CN.euc

ISO 2022-7

zh_CN.iso2022-7

ISO 2022-7

zh_CN.iso2022-7

GB2312-80

zh_CN.euc

GB2312-80

zh_CN.euc

ISO 2022-CN

zh_CN.iso2022-CN

ISO-2022-CN

zh_CN.iso2022-CN

GB2312-80

zh_CN.euc

UTF-8

UTF-8

GB2312-80

zh_CN.euc

GB2312-80

zh_CN.euc

UTF-8

UTF-8

zh.GBK

zh_CN.gbk

ISO2022-CN

zh_CN.iso2022-CN

ISO2022-CN

zh_CN.iso2022-CNzh.GBKzh_CN.gbk

zh.GBK

zh_CN.gbk

Big-5

zh_TW-Big5

Big-5

zh_TW-Big5

zh.GBK

zh_CN.gbk

GB2312-80

zh_CN.euc

Big-5

zh_TW-Big5

Big-5

zh_TW-Big5

GB2312-80

zh_CN.euc

UTF-8

UTF-8

zh.GBK

zh_CN.gbk

zh.GBK

zh_CN.gbk

UTF-8

UTF-8

UTF-8

UTF-8

ISO2022-CN

zh_CN.iso2022-CN

ISO2022-CN

zh_CN.iso2022-CN

UTF-8

UTF-8

Traditional Chinese in the Solaris 7 product provides two locales: zh_TW and zh_TW.BIG5. In the zh_TW locale, the EUC scheme is used to encode CNS 11643.1992 codeset. The zh_TW.BIG5 locale supports the Big-5 codeset.

Traditional Chinese is used mostly in Taiwan and Hong Kong.

Traditional Chinese supports the following input methods:

  • Chuyin

  • I-Tien

  • Telecode

  • TsangChieh

  • CheinI

  • NeiMa

  • ChuangHsing

  • Array

  • BoShiaMy

  • DaYi

    Table 3-12 shows Traditional Chinese Truetype Fonts for the zh_TW Locales

Table 3-12 Traditional Chinese Truetype Fonts for the zh_TW Locales
Full Family NameSubfamilyFormatVendorEncoding
HeiRTruetypeHanyiCNS11643.1992
KaiRTruetypeHanyiCNS11643.1992
MingRTruetypeHanyiCNS11643.1992

Table 3-13 shows the Traditional Chinese BitMap Fonts for the zh_TW Locales

Table 3-13 Traditional Chinese BitMap Fonts for the zh_TW Locales
Full Family NameSubfamilyFormatEncoding
MingRPCF (12,14,16,20,24)CNS11643.1992

Table 3-14 shows the Traditional Chinese TrueType Fonts for the zh_TW.BIG5 Locales

Table 3-14 Traditional Chinese TrueType Fonts for the zh_TW.BIG5 Locales
Full Family NameSubfamilyFormatVendorEncoding
HeiRTrueTypeHanyiBig5
KaiRTrueTypeHanyiBig5
MingRTrueTypeHanyiBig5

Table 3-15 shows the Traditional Chinese BitMap Fonts for the zh_TW.BIG5 Locales

Table 3-15 Traditional Chinese BitMap Fonts for the zh_TW.BIG5 Locales
Full Family NameSubfamilyFormatEncoding
MingRPCF (12,14,16,20,24)Big5

Table 3-16 shows the supported codeset conversions for Traditional Chinese.

Table 3-16 Codeset Conversions for Traditional Chinese

Code

Symbol

TargetCode

Symbol

CNS 11643

zh_TW-euc

Big-5

zh_TW-Big5

CNS 11643

zh_TW-euc

ISO 2022-7

zh_TW-iso2022-7

Big-5

zh_TW-Big5

CNS 11643

zh_TW-euc

Big-5

zh_TW-Big5

ISO 2022-7

zh_TW-iso2022-7

ISO 2022-7

zh_TW-iso2022-7

CNS 11643

zh_TW-euc

ISO 2022-7

zh_TW-iso2022-7

Big-5

zh_TW-Big5

CNS 11643

zh_TW-eu

ISO 2022-CN-EXT

zh_TW-iso2022-CN-EXT

ISO 2022-CN-EXT

zh_TW-iso2022-CN-EXT

CNS 11643

zh_TW-euc

Big-5

zh_TW-Big5

ISO 2022-CN

zh_TW-iso2022-CN

ISO 2022-CN

zh_TW-iso2022-CN

Big-5

zh_TW-Big5

UTF-8

UTF-8

CNS 11643

zh_TW-euc

CNS 11643

zh_TW-euc

UTF-8

UTF-8

UTF-8

UTF-8

Big-5

zh_TW-Big5

Big-5

zh_TW-Big5

UTF-8

UTF-8

UTF-8

UTF-8

ISO 2022-7

zh_TW-iso2022-7

ISO 2022-7

zh_TW-iso2022-7

UTF-8

UTF-8

ISO 2022-CN-EXT

zh_TW-iso2022-CN-EX

Big-5

zh_TW-Big5

Big-5

zh_TW-Big5

ISO 2022-CN-EXT

zh_TW-iso2022-CN-EXT

Japanese Input Systems

Three Japanese input systems are bundled in Japanese Solaris 7. They can be used in the ja, ja_JP.PCK and ja_JP.UTF-8 locales. However, some maintenance utilities do not support the PCK codeset.

The Japanese Input System is shown below in Table 3-17.

Table 3-17 Japanese Input Systems

Name

Description

Wnn6

Wnn6 consists of the Kana-Kanji conversion server (jserver), interface module for htt (X Input Method Server) called xjsi.so, utilities, and dictionaries. Wnn6 is the default Japanese input system.

Wnn6 supports JIS X 0201-1976, JIS X 0208-1990 and JIS X0212-1990 character sets.

ATOK8

ATOK8 consists of atok8 X Input Method Server, utilities, and dictionaries. ATOK8 is a popular Japanese input system facility in the Japanese PC market. ATOK7 was released with Solaris 2.1 until 2.5.1 has been replaced by ATOK8.

ATOK8 supports JIS X 0201-1976 and JIS X 0208-1990 character sets.

cs00

cs00 consists of the Kana-Kanji conversion server (cs00), interface module for htt (X Input Method Server) called xci.so, utilities, and dictionaries. cs00 has been bundled with Japanese Solaris since Solaris 2.1

cs00 supports JIS X 0201-1976, JIS X 0208-1990 and JIS X 0212-1990 character sets.

Japanese TrueType Fonts are show below in Table 3-18.

Table 3-18 Japanese TrueType Fonts

Full Family Name

Subfamily

Format

Vendor

Encoding

hg gothic b

R

TrueType

RICOH

JISX0208.1983, JISX0201.1976

hg mincho l

R

TrueType

RICOH

JISX0208.1983, JISX0201.1976

heiseimin

R

TrueType

RICOH

JISX0212.1990

Japanese Bitmap Fonts are shown in Table 3-19 below.

Table 3-19 Japanese Bitmap Fonts

Full Family Name

Subfamily

Format

Vendor

Encoding

gothic

R, B

PCF(12,14,16,20,24)

JISX0208.1983,

JISX0201.1976

minchou

R

PCF(12,14,16,20,24)

JISX0208.1983,

JISX0201.1976

hg gothic b

R

PCF(12,14,16,18,20,24)

RICOH

JISX0208.1983, JISX0201.1976

hg mincho l

R

PCF(12,14,16,18,20,2)

RICOH

JISX0208.1983, JISX0201.1976

heiseimin

R

PCF(12,14,16,18,20,24)

RICOH

JISX0212.1990

Japanese Locales

Japanese Solaris 7 supports three locales. The ja locale is based on Japanese EUC. The ja_JP.PCK locale is based on PC-Kanji code (Shift JIS) and the ja_JP.UTF-8 locale is based on UTF-8.

Japanese Messages and man Pages

Some messages and manual pages have been translated into Japanese in Japanese Solaris 7.

Japanese Character Code Converter for iconv

The following table shows supported conversion with iconv(1) and iconv(3). See the iconv_ja(5)man page for details.

Table 3-20 shows iconv Conversion Support.

Table 3-20 iconv Conversion Support

Source Code

Target Code

eucJP

PCK

eucJP

JIS7

eucJP

SJIS

eucJP

UTF-8

eucJP

jis

eucJP

ibmj

SJIS

eucJP

SJIS

ISO-2022-JP

SJIS

UTF-8

SJIS

jis

SJIS

ibmj

PCK

eucJP

PCK

UTF-8

PCK

ISO-2022-JP

PCK

jis

PCK

ibmj

ISO-2022-JP

eucJP

ISO-2022-JP

PCK

ISO-2022-JP

SJIS

UTF-8

eucJP

UTF-8

SJIS

UTF-8

PCK

JIS7

eucJP

jis

eucJP

jis

PCK

jis

SJIS

ibmj

eucJP

ibmj

PCK

UTF-8

ISO-2022-JP

ISO-2022-JP

UTF-8

eucJP

UTF-8-Java

UTF-8-Java

eucJP

PCK

UTF-8-Java

UTF-8-Java

PCK

eucJP

ISO-2022-JP.RFC1468

PCK

ISO-2022-JP.RFC1468

UTF-8

ISO-2022-JP.RFC1468

eucJP

ibmj-EBCDIK

ibmj-EBCDIK

eucJP

PCK

ibmj-EBCDIK

ibmj-EBCDIK

PCK

Japanese Character Code Converter for TTY STREAMS

There are TTY STREAMS modules that perform code conversion between an encoding for a specific terminal and an encoding for a specific locale. With an appropriate STREAMS module, a user can log in from a Japanese terminal into a Japanese locale, even if the encoding between the terminal and the Japanese locale does not match. tty(1) controls the behavior of those STREAMS modules.

Japanese-specific Printer Support

The Japanese Solaris 7 product supports the following Japanese-specific printers:

  • Epson VP-5085 (based on ESC/P)

  • NEC PC-PR201 (based on 201PL)

  • Canon LASERSHOT (based on LIPS)

  • Japanese PostScript Printer

JLE Binary Compatibility Package

The Japanese Solaris 7 package also provides Japanese Solaris 1.1.x binary-compatibility packages that are the same as the base products.

User-Defined Character (UDC) Support

To handle User-Defined Characters, sdtudctool has been available since the Solaris 2.6 release. Sdtudctool handles both outline (Type1) and bitmap (PCF) fonts. Some utilities are also available to migrate the UDC fonts that were created by old utilities, such as fontedit, type3creator andfontmanager in prior releases.

Korean Solaris 7 Product

The Korean Solaris product, used mostly in Korea, supports all the locales available in the English/Euro products. Additionally, it supports two Korean locales: ko and ko.UTF-8. In the ko locale, the EUC scheme is used to encode KSC 5601-1987. The ko.UTF-8 locale supports the KSC 5700-1995/Unicode 2.0 codeset, which is a super set of KSC 5601-1987. These two locales look the same for the end user, but the internal character encoding is different. The Korean Solaris product supports the following Input Methods

for the ko locale:

  • Hangul 2-BeolSik (1 set of consonants and 1 set of vowels)

  • Hangul-Hanja conversion

  • Special character

  • Hexadecimal code

for the ko.UTF-8 locale:

  • Hangul 2-BeolSik (1 set of consonants and 1 set of vowels)

  • Hangul-Hanja conversion

  • Special character

  • Hexadecimal code

The following fonts are available in the Korean version of the Solaris 7 product:

Table 3-21 Solaris 7 Korean CID/Type 1 Fonts for the ko Locale
Full Family NameSubfamilyFormatVendorEncoding
GothicRCID/Type 1HanyangAdobe-Korean
GraphicRCID/Type 1HanyangAdobe-Korean
HaesoRCID/Type 1HanyangAdobe-Korean
KodigRCID/Type 1HanyangAdobe-Korean
MyeongijoRCID/Type 1HanyangAdobe-Korean
PilkiRCID/Type 1HanyangAdobe-Korean
RoundgothicRCID/Type 1HanyangAdobe-Korean

Table 3-22 Solaris 7 Korean Bitmap Fonts for the ko Locale
Full Family NameSubfamilyFormatEncoding
GothicR/BPCF (12,14,16,18,20,24)KSC 5601-1987
GraphicR/BPCF (12,14,16,18,20,24)KSC 5601-1987
HaesoR/BPCF (12,14,16,18,20,24)KSC 5601-1987
KodigR/BPCF (12,14,16,18,20,24)KSC 5601-1987
MyeongijoR/BPCF (12,14,16,18,20,24)KSC 5601-1987
PilkiR/BPCF (12,14,16,18,20,24)KSC 5601-1987
RoundgothicR/BPCF (12,14,16,18,20,24)KSC 5601-1987

Table 3-23 Solaris 7 Korean CID/Type 1 Fonts for the ko.UTF-8 Locale
Full Family NameSubfamilyFormatVendorEncoding
GothicRCID/Type 1HanyangAdobe-Korean
GraphicRCID/Type 1HanyangAdobe-Korean
HaesoRCID/Type 1HanyangAdobe-Korean
KodigRCID/Type 1HanyangAdobe-Korean
MyeongijoRCID/Type 1HanyangAdobe-Korean
PilkiRCID/Type 1HanyangAdobe-Korean

Table 3-24 Solaris 7 Korean Bitmap Fonts for the ko.UTF-8 Locale
Full Family NameSubfamilyFormatEncoding
GothicR/BPCF (12,14,16,18,20,24)KSC 5601-1992 (Johap)
GraphicR/BPCF (12,14,16,18,20,24)KSC 5601-1992 (Johap)
HaesoR/BPCF (12,14,16,18,20,24)KSC 5601-1992 (Johap)
KodigR/BPCF (12,14,16,18,20,24)KSC 5601-1992 (Johap)
MyeongijoR/BPCF (12,14,16,18,20,24)KSC 5601-1992 (Johap)
PilkiR/BPCF (12,14,16,18,20,24)KSC 5601-1992 (Johap)

Table 3-25 Korean ICONV
CodeSymbolTarget CodeSymbol
KSC 5601-19871506UTF-8UTF-8
ISO 646646KSC 5601-19875601
KSC 5601-1987EUC-KRUTF-8UTF-8
KSC 5601-1987KSC5601UTF-8UTF-8
UTF-8UTF-8KSC 5601-19875601
UTF-8UTF-8KSC 5601-1987EUC-KR
UTF-8UTF-8KSC 5601-1987KSC 5601
UTF-8ko-KR-UTF-8IBM CP 933cp 933
UTF-8ko-KR-UTF-8KSC 5601-1987ko_KR-euc
UTF-8ko-KR-UTF-8ISO2022-KRko_KR-iso2022-7
UTF-8ko-KR-UTF-8KSC 5601-1987 - Johapko_KR-johap
UTF-8ko-KR-UTF-8KSC5601-1992 - Johapko_KR-johap92
IBM CP933cp933UTF-8ko_KR-UTF-8
KSC 5601-1987ko_KR-eucUTF-8ko_KR-UTF-8
KSC 5601-1987ko_KR-eucISO 2022-KRko_KR-iso2022-7
KSC 5601-1987ko_KR-eucKSC 5601-1987 - Johapko_KR-johap
KSC 5601-1987ko_KR-eucKSC 5601-1992 - Johapko_KR-johap92
KSC 5601-1987ko_KR-eucKSC 5601-1992-Annex:4ko_KR-nbyte
ISO 2022-KRiso2022-7UTF-8ko_KR-UTF-8
ISO 2022-KRiso2022-7KSC 5601-1987ko_KR-euc
KSC 5601-1987 - Johapko-KR-johapUTF-8ko_KR-UTF-8
KSC 5601-1987 - Johapko-KR-johapKSC 5601-1987ko_KR-euc
KSC 5601-1992 - Johapko-KR-johap92UTF-8ko_KR-UTF-8
KSC 5601-1992 - Johapko-KR-johap92KSC 5601-1987ko_KR-euc
KSC 5601-1992 - Annex:4ko-KR-nbyteKSC 5601-1987ko_KR-euc

How to Use the iconv Command

The iconv command converts the characters or sequences of characters in a file from one codeset to another, then writes the results to standard output. If there is no conversion for a particular character, it is converted into an underscore `_' in the target codeset. See the iconv(1) man page for more information.

The following options are supported:

  • -f fromcode Symbol of the input codeset.

  • -t tocode Symbol of the output codeset.

To convert a mail file from one encoding into another, use the iconv command:

example% iconv -f from_codeset -t to_codeset mail.codeset > mail.codeset