<?xml version="1.0" encoding="utf-8"?>
<book fpi="-//Sun::SunSoft//DOCUMENT SOLASNSUPPT Version 2.0//en" role="numbered" label="alpha" id="solasnsuppt" lang="en" userlevel="developer"><title lang="en">Asian-Language Support in the Solaris Operating Environment</title><bookinfo><bookbiblio><title lang="en">Asian-Language Support in the Solaris Operating Environment</title><authorgroup lang="en"><author lang="en"><firstname lang="en">John</firstname><surname lang="en">White</surname></author></authorgroup><isbn lang="en"/><pubsnumber lang="en"><gentext type="text">Part No: </gentext>806-5582</pubsnumber><releaseinfo lang="en"/><pubdate lang="en">May 2000</pubdate><publisher lang="en"><publishername lang="en">Sun Microsystems, Inc.</publishername><address lang="en"><street lang="en">901 San Antonio Road</street><city lang="en">Palo Alto<gentext type="text">, </gentext></city><state lang="en">CA<gentext type="text"></gentext></state><postcode lang="en">94303-4900</postcode><country lang="en">U.S.A.</country></address></publisher><copyright lang="en"><year lang="en">2000</year><holder lang="en">     Sun Microsystems</holder></copyright><abstract lang="en"><para lang="en">Title: Asian-Language Support in the Solaris Operating Environment</para><para lang="en">Part number: 806-5582</para><para lang="en">Audience: System administrators, software developers</para><para lang="en">Page count: 39</para><para lang="en">Keywords: Solaris 8 operating environment, Internationalization, I18N,
Asian language</para><para lang="en">This book provides information and software features for internationalizing
software in Asian-language markets</para></abstract></bookbiblio><legalnotice lang="en"><para lang="en">This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers.</para><para lang="en">Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd.</para><para lang="en">Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, 
 and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. </para><para lang="en">The OPEN LOOK and <trademark class="trade" lang="en">Sun</trademark> Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Sun's licensees who implement OPEN LOOK GUIs and otherwise comply with Sun's written license agreements.</para><para lang="en">Federal Acquisitions: Commercial Software-Government Users Subject to Standard License Terms and Conditions.</para><para lang="en">DOCUMENTATION IS PROVIDED "AS IS" AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-INFRINGEMENT, ARE DISCLAIMED, EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID.</para><para lang="fr"/><para lang="en">Ce produit ou document est protégé par un copyright et distribué avec des licences qui en restreignent l'utilisation, la copie, la distribution, et la décompilation. Aucune partie de ce produit ou document ne peut être reproduite sous aucune forme, par quelque moyen que ce soit, sans l'autorisation préalable et écrite de Sun et de ses bailleurs de licence, s'il y en a. Le logiciel détenu par des tiers, et qui comprend la technologie relative aux polices de caractères, est protégé par un copyright et licencié par des fournisseurs de Sun.</para><para lang="en">Des parties de ce produit pourront être dérivées du système Berkeley BSD licenciés par l'Université de Californie. UNIX est une marque déposée aux Etats-Unis et dans d'autres pays et licenciée exclusivement par X/Open Company, Ltd. </para><para lang="en">Sun, Sun Microsystems, le logo Sun, docs.sun.com, AnswerBook, AnswerBook2, 
 et Solaris sont des marques de fabrique ou des marques déposées, ou marques de service, de Sun Microsystems, Inc. aux Etats-Unis et dans d'autres pays. Toutes les marques SPARC sont utilisées sous licence et sont des marques de fabrique ou des marques déposées de SPARC International, Inc. aux Etats-Unis et dans d'autres pays. Les produits portant les marques SPARC sont basés sur une architecture développée par Sun Microsystems, Inc.</para><para lang="en">L'interface d'utilisation graphique OPEN LOOK et <trademark class="trade" lang="en">Sun</trademark> a été développée par Sun Microsystems, Inc. pour ses utilisateurs et licenciés. Sun reconnaît les efforts de pionniers de Xerox pour la recherche et le développement du concept des interfaces d'utilisation visuelle ou graphique pour l'industrie de l'informatique. Sun détient une licence non exclusive de Xerox sur l'interface d'utilisation graphique Xerox, cette licence couvrant également les licenciés de Sun qui mettent en place l'interface d'utilisation graphique OPEN LOOK et qui en outre se conforment aux licences écrites de Sun.</para><para lang="en">CETTE PUBLICATION EST FOURNIE "EN L'ETAT" ET AUCUNE GARANTIE, EXPRESSE OU IMPLICITE, N'EST ACCORDEE, Y COMPRIS DES GARANTIES CONCERNANT LA VALEUR MARCHANDE, L'APTITUDE DE LA PUBLICATION A REPONDRE A UNE UTILISATION PARTICULIERE, OU LE FAIT QU'ELLE NE SOIT PAS CONTREFAISANTE DE PRODUIT DE TIERS. CE DENI DE GARANTIE NE S'APPLIQUERAIT PAS, DANS LA MESURE OU IL SERAIT TENU JURIDIQUEMENT NUL ET NON AVENU.</para></legalnotice><subjectset lang="en"><subject lang="en"><subjectterm lang="en">Programming &amp; Tools</subjectterm></subject><subject lang="en"><subjectterm lang="en">System Administration</subjectterm></subject></subjectset></bookinfo><preface id="preface-1" lang="en" role="preface"><gentext type="text">Preface</gentext><gentext type="toc">Preface</gentext><title lang="en">Preface</title><highlights lang="en"><para lang="en">The <citetitle lang="en">Asian-Language Support in the <trademark class="trade" lang="en">Solaris</trademark> Operating Environment</citetitle> white paper presents information and software features for internationalizing software in Asian-language markets. </para><para lang="en"/></highlights><sect1 id="preface-9" lang="en"><title id="preface-2" lang="en">Who Should Use This Book</title><para lang="en">This white paper is intended for software developers who are interested in developing internationalized software for the Asian-language <trademark class="trade" lang="en">Solaris</trademark> operating environment. This white paper is part of a 4-part series on internationalization for Solaris software developers. The four internationalization white papers are:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en"><citetitle lang="en">Asian-Language Support in the <trademark class="trade" lang="en">Solaris</trademark> Operating Environment</citetitle></para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">Complex Text Layout Language Support in the <trademark class="trade" lang="en">Solaris</trademark> Operating Environment</citetitle></para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">Unicode Support in the <trademark class="trade" lang="en">Solaris</trademark> Operating Environment</citetitle></para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">Euro Currency Support in the <trademark class="trade" lang="en">Solaris</trademark> Operating Environment</citetitle></para></listitem></itemizedlist></sect1><sect1 id="preface-4" lang="en"><title lang="en">How This Book Is Organized</title><para lang="en"><link linkend="chapter1-1" lang="en">Chapter 1</link> provides an overview of internationalization.</para><para lang="en"><link linkend="chapter2-1" lang="en">Chapter 2</link> provides an overview of Solaris internationalization.</para><para lang="en"><link linkend="chapter3-1" lang="en">Chapter 3</link> describes internationalization details of Asian languages, specifically Chinese, Japanese, and Korean.</para><para lang="en"><link linkend="chapter4-1" lang="en">Chapter 4</link> addresses the technical concerns of the Asian-specific architecture.</para><para lang="en"><link linkend="chapter5-1" lang="en">Chapter 5</link> describes common development issues in multibyte applications.</para><para lang="en"><link linkend="appendixa-1" lang="en">Appendix A</link> shows features for the <link linkend="appendixa-2" lang="en">Common Desktop Environment</link>, <link linkend="appendixa-3" lang="en">Simplified Chinese</link>, <link linkend="appendixa-4" lang="en">Traditional Chinese</link>, <link linkend="appendixa-5" lang="en">Japanese</link>, and <link linkend="appendixa-6" lang="en">Korean</link> Solaris operating environment. </para></sect1><sect1 id="preface-5" lang="en"><title lang="en">Related Books</title><para lang="en">The following books are related to software internationalization: </para><para lang="en"><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en"><citetitle lang="en">Creating Worldwide Software: Solaris International Developer's Guide</citetitle> Bill Tuthill and David Smallberg.</para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">Internationalization Guide, Version 2: Open Group Guide</citetitle> The Open Group</para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">International Language Environments Guide</citetitle> Solaris Developer Collection.</para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">Programming for the World: A Guide to Internationalization</citetitle> Sandra Martin O'Donnell.</para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">The Unicode Standard, Version 3.0</citetitle> The Unicode Consortium.</para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">X Windows on the World, Developing Internationalized Software with X, Motif, and CDE</citetitle> Thomas C. McFarland.</para></listitem></itemizedlist><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">The following book is related to Asian-language internationalization: </para></listitem><listitem lang="en"><para lang="en"><citetitle lang="en">CJKV Information Processing: Chinese, Japanese, Korean, and Vietnamese Computing</citetitle> Ken Lunde </para></listitem></itemizedlist></para></sect1><sect1 id="sundocs-1" lang="en"><title lang="en">Ordering Sun Documents</title><para lang="en">Fatbrain.com, an Internet professional bookstore, stocks select product
documentation from Sun Microsystems, Inc.</para><para lang="en">For a list of documents and how to order them, visit the Sun Documentation
Center on Fatbrain.com at <ulink url="http://www1.fatbrain.com/documentation/sun"><literal moreinfo="none" lang="en">http://www1.fatbrain.com/documentation/sun</literal></ulink>.</para></sect1><sect1 id="sundocs-2" lang="en"><title lang="en">Accessing Sun Documentation Online</title><para lang="en">The <trademark class="service" lang="en">docs.sun.com</trademark> Web site enables
you to access Sun technical documentation online. You can browse the docs.sun.com
archive or search for a specific book title or subject. The URL is <ulink url="http://docs.sun.com"><literal moreinfo="none" lang="en">http://docs.sun.com</literal></ulink>.</para></sect1></preface><chapter id="chapter1-1" lang="en"><gentext type="text">Chapter 1</gentext><gentext type="toc">1.  Solaris Operating Environment in International Markets</gentext><title lang="en">Solaris Operating Environment in International Markets</title><sect1 id="chapter1-2" lang="en"><title lang="en"><gentext type="text">1.1 </gentext>The Need for Global Software Development</title><para lang="en">In an integrated global economy, software applications must be compatible in numerous languages and cultures. Users want to run applications in their own language, using their own local conventions. Furthermore, international companies have international needs. For example, a large corporation with headquarters in Tokyo and branches in New York and Paris may require a mixture of English, Japanese, and French software environments supporting multiple languages on one site.  </para><para lang="en">To sell software to multinational companies, developers must always be aware of local customs, conventions, and requirements during development, such as character sets, numeric, time, date, and monetary formats, and messages. Adapting software to localized writing systems is particularly challenging in Asian markets. 8-bit encoding is good enough for European phonetic alphabets, but Chinese, Japanese, and Korean ideographs require multibyte encoding.</para></sect1><sect1 id="chapter1-3" lang="en"><title lang="en"><gentext type="text">1.2 </gentext>Software Internationalization</title><para lang="en">Internationalized software applications include internationalized code and localized locale-specific data.</para><para lang="en">Internationalization <emphasis lang="en">generalizes</emphasis> software by using a single internationalized binary which retrieves locale-specific data and shared objects at run time. The application runs on any localized version of the Solaris operating environment, without requiring source code changes or recompilation.</para><para lang="en">Localization <emphasis lang="en">customizes</emphasis> software data, providing locale-specific modules that meet local requirements. Localization can be either of the following:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">Full localization--input, output, print, cultural conventions, and translated message text.</para></listitem><listitem lang="en"><para lang="en">Partial localization--input, output, print, cultural conventions, without translated message text.</para></listitem></itemizedlist><para lang="en">Developers generally create applications for the U.S. market. Internationalization is especially important in newer, smaller markets which don't yet justify full localization. Here, a phased approach is recommended, beginning with the current internationalized Solaris operating environment version, followed by localization as the market grows as illustrated in <link linkend="chapter1-fig-1" lang="en">Figure 1-1</link>. </para><figure float="0" id="chapter1-fig-1" lang="en"><gentext type="text">Figure 1-1 </gentext><title lang="en">Market-entry strategy for localized products</title><graphic filename="figures/Fig1-1.eps.gif" width="759" depth="297" lang="en"/></figure><para lang="en">Sun Microsystems requires that all applications be internationalized.</para></sect1><sect1 id="chapter1-8" lang="en"><title lang="en"><gentext type="text">1.3 </gentext>Benefits of Internationalized Software</title><para lang="en"> Internationalization addresses many of the key software issues:          </para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">Improving software quality</para></listitem><listitem lang="en"><para lang="en">Reducing development time and cost</para></listitem><listitem lang="en"><para lang="en">Enabling code reuse</para></listitem><listitem lang="en"><para lang="en">Reducing localization costs</para></listitem><listitem lang="en"><para lang="en">Reducing maintenance costs</para></listitem><listitem lang="en"><para lang="en">Increasing customer satisfaction</para></listitem></itemizedlist><para lang="en">Using separate locale-specific files to localize applications is simpler, faster, and more cost effective on the whole. Software is more easily released world wide at less expense. As well, users are much happier working in their own language with their own conventions.</para><para lang="en">Using a single binary in an internationalized application ensures that the same feature set is available for a particular software version and lessens support, maintenance, and system-administration costs. Interoperability and productivity improves and common training materials can be used. Most importantly, internationalized and localized applications help developers compete and succeed in new foreign-language markets. Everyone benefits with internationalized software.</para></sect1></chapter><chapter id="chapter2-1" lang="en"><gentext type="text">Chapter 2</gentext><gentext type="toc">2.  Internationalized Software for the Solaris Operating Environment</gentext><title lang="en">Internationalized Software for the Solaris Operating Environment</title><sect1 id="chapter2-2" lang="en"><title lang="en"><gentext type="text">2.1 </gentext>Solaris Language-Support Framework</title><para lang="en"/><para lang="en">In an internationalized application, language-specific features and cultural data are separated from application code. The Solaris internationalization framework divides code and language and cultural data into the following three areas:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">Locale</para></listitem><listitem lang="en"><para lang="en">Interface localization</para></listitem><listitem lang="en"><para lang="en">Codeset independence</para></listitem></itemizedlist><para lang="en">A <emphasis lang="en">locale</emphasis> is a set of language and cultural variables, particular to a global region. The locale is selected by the user and loaded in memory at run time. The selected locale applies to the operating system and subsequent application launches.</para><para lang="en"><emphasis lang="en">Interface localization</emphasis> is the process of translating the interface language into another language by storing text strings and messages in a separate message file. Messages are more easily composed, translated, and referenced in a separate file than in hard-coded statements throughout the application. Furthermore, recompilation of the source binary is unnecessary.</para><para lang="en"><emphasis lang="en">Codeset independence</emphasis> does not assume a particular codeset to display and manipulate data.</para></sect1><sect1 id="chapter2-3" lang="en"><title lang="en"><gentext type="text">2.2 </gentext>Locale</title><para lang="en">The Solaris operating environment provides a number of locales. Each locale includes:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">Associated codeset and codeset conversion modules</para></listitem><listitem lang="en"><para lang="en">Numeric, time, and date formats</para></listitem><listitem lang="en"><para lang="en">Collation (sort order)  </para></listitem><listitem lang="en"><para lang="en">Monetary format</para></listitem><listitem lang="en"><para lang="en">Interface information (messages and icons) </para></listitem><listitem lang="en"><para lang="en">Input method(s)</para></listitem><listitem lang="en"><para lang="en">Fonts</para></listitem></itemizedlist><para lang="en">Developers access locale settings directly through Solaris operating environment APIs. For example, instead of encoding a particular currency symbol, an application calls the appropriate system API, which returns the currency symbol of the set locale.</para></sect1><sect1 id="chapter2-4" lang="en"><title lang="en"><gentext type="text">2.3 </gentext>Interface Localization</title><para lang="en">The Solaris operating environment supports several messaging schemes for localizing the interface, including the Sun proprietary API <computeroutput moreinfo="none" lang="en">gettext()</computeroutput> and the XPG <computeroutput moreinfo="none" lang="en">catgets()</computeroutput>. These APIs directly reference the message file.</para><para lang="en">Note that the size and position of interface elements (icons, graphics, and functions or private data affecting text elements) may be different in different languages. For example, Japanese messages are usually longer than English messages and Japanese ideographs are taller and wider than English characters. Text widget positioning should be relative, not absolute.</para><para lang="en">Icons and graphics should also be culturally neutral or be easily changeable to local tastes. Essentially, what a user sees and what affects text should be changed only in the message catalog, resources, or some other means.</para></sect1><sect1 id="chapter2-5" lang="en"><title lang="en"><gentext type="text">2.4 </gentext>Codeset Independence</title><para lang="en">The Solaris operating environment architecture supports codeset independence (CSI), expanding the number of supported codesets from Extended UNIX\256 Codeset (EUC) to both EUC and non-EUC encodings, including PC-Kanji (also known as ShiftJIS) and GBK.</para><para lang="en">Note that text-handling routines should not define the size of the character codeset. Nor should other locale-specific components, such as the window system, input method, and online help, depend on a particular codeset. <link linkend="chapter2-fig-1" lang="en">Figure 2-1</link> shows the locale-specific components which should be codeset independent.</para><figure float="0" id="chapter2-fig-1" lang="en"><gentext type="text">Figure 2-1 </gentext><title lang="en">Design model for international software</title><graphic filename="figures/Fig2-1.eps.gif" width="759" depth="178" lang="en"/></figure><para lang="en">Support for Unicode, a universal codeset encompassing most written characters, is often confused with codeset independence. Unicode is often referred to as ISO 10646 and is an International Standards Organization (ISO) standard. Note that codeset independence must also apply to Unicode. Although Unicode supports many languages and writing systems, to an application Unicode is just another codeset. The Solaris operating environment supports the Unicode UTF-8 (File System Safe UCS Transformation Format) format, which is compatible with ISO 10646. For more information, see <citetitle lang="en">Unicode Support in the Solaris Operating Environment</citetitle>.</para><note lang="en" role="note"><gentext type="text">Note - </gentext><para lang="en">Codeset independence is often assumed because the idea of a character (in ISO C terms) and char (or byte) is thought of as a one-to-one relationship in programming languages. In written languages, however, the idea of a character can encompass one char/byte or multiple bytes. An alphabetic character from most European languages can be represented in one byte. An Asian-language character often requires more than one byte because there are more characters in the charset than one byte can represent.</para><para lang="en">Furthermore, applications often assume the representation of a given character. For example, a codeset independent application does not assume that `a' = \x61 or char = byte. Instead, during text-manipulation routines, such as truncating a stream of characters, the APIs determine the size of the number of bytes by the character and its definition or type. By not assuming the size of a character or the codeset, the application will be codeset independent.</para><para lang="en">Solaris maintains a codeset independence framework. Applications can use Solaris APIs to determine the size of the number of bytes used by the character and its definition or type. By not making assumptions about the underlying codeset, an application is codeset independent in Solaris.</para></note></sect1></chapter><chapter id="chapter3-1" lang="en"><gentext type="text">Chapter 3</gentext><gentext type="toc">3.  Asian Language Overview</gentext><title lang="en">Asian Language Overview</title><highlights lang="en"><para lang="en">A phonetic writing system, such as English, consists of a collection of phonetic letters to represent a word or idea. Asian languages, such as Chinese, Japanese, and Korean, however, use symbols or <emphasis lang="en">ideographs</emphasis> to represent words and ideas.</para><para lang="en">Chinese, Japanese, and Korean ideographs are all derived from the Chinese ideographic system, numbered in the tens of thousands. Collectively, the ideographs are called <emphasis lang="en">han characters</emphasis> and are referred to as <emphasis lang="en">hanzi</emphasis> in Chinese, <emphasis lang="en">kanji</emphasis> in Japanese, and <emphasis lang="en">hanja</emphasis> in Korean.</para><para lang="en">Note that an ideograph may be pronounced in several ways, depending on the context.  As well, two different ideographs may be identically pronounced. The Solaris operating environment has been designed to include support for contextual ideographs in Asian-language writing systems. </para></highlights><sect1 id="chapter3-2" lang="en"><title lang="en"><gentext type="text">3.1 </gentext>Chinese</title><para lang="en">Two Chinese writing systems are used today--Traditional Chinese and Simplified Chinese. Their ideographs originated in China thousands of years ago.  </para><para lang="en">Used in the Republic of China (Taiwan), Traditional Chinese has approximately 50,000 characters. Many of the older and more complex characters are still used today. <link linkend="chapter3-fig-1" lang="en">Figure 3-1</link> shows Traditional Chinese characters representing the word "China."           </para><figure float="0" id="chapter3-fig-1" lang="en"><gentext type="text">Figure 3-1 </gentext><title lang="en">Traditional Chinese character representing the word "China" </title><graphic filename="figures/Fig3-1.tiff.gif" width="79" depth="40" scale="90" lang="en"/></figure><para lang="en">Used in the People's Republic of China (PRC), Simplified Chinese is a subset of the characters in Traditional Chinese. In 1955, the PRC government started eliminating and simplifying some ideographs by reducing the number of strokes needed to render a character. The Simplified Chinese character set is now simpler and smaller. Figure shows Simplified Chinese characters representing the word "China."           </para><figure float="0" id="chapter3-fig-2" lang="en"><gentext type="text">Figure 3-2 </gentext><title lang="en">Simplified Chinese character representing the word "China"</title><graphic filename="figures/Fig3-2.tiff.gif" width="67" depth="48" scale="160" lang="en"/></figure></sect1><sect1 id="chapter3-3" lang="en"><title lang="en"><gentext type="text">3.2 </gentext>Japanese</title><para lang="en">The Japanese language uses a combination of four different writing systems--<emphasis lang="en">kanji</emphasis> characters, <emphasis lang="en">hiragana</emphasis>, <emphasis lang="en">katakana</emphasis>, and the Roman alphabet phonetic system <emphasis lang="en">romaji</emphasis>.</para><para lang="en">Kanji characters are derived from Traditional Chinese characters and are often found in combination with hiragana, katakana, and romaji.</para><para lang="en">Hiragana is a set of 83 symbols, called a <emphasis lang="en">syllabary</emphasis>, that encompasses all the basic syllables used for Japanese pronunciation. In written Japanese, the hiragana syllabary expresses grammatical parts of speech, verb tenses, and some words for which there are no kanji characters or have become obsolete.  </para><para lang="en">Katakana is another phonetic syllabary consisting of a different set of symbols for the same sounds expressed in hiragana. The syllables represented by hiragana and katakana are generically called <emphasis lang="en">kana. </emphasis><link linkend="chapter3-fig-3" lang="en">Figure 3-3</link> shows the differences between hiragana, katakana, and kanji characters.</para><para lang="en">Romaji is used to write Japanese sounds with Roman letters. Romaji characters are usually displayed in double-width format.           </para><figure float="0" id="chapter3-fig-3" lang="en"><gentext type="text">Figure 3-3 </gentext><title lang="en">The differences between hiragana, katakana, and kanji characters</title><graphic filename="figures/fig3-3.gif" width="235" depth="97" scale="50" lang="en"/></figure></sect1><sect1 id="chapter3-4" lang="en"><title lang="en"><gentext type="text">3.3 </gentext>Korean</title><para lang="en">The Korean language uses a combination of two different writing systems--<emphasis lang="en">hanja</emphasis> characters and <emphasis lang="en">hangul</emphasis> characters.</para><para lang="en">Hanja characters are derived from Traditional Chinese characters and are often used for formal written communication and proper names. An example of hanja is shown in <link linkend="chapter3-fig-4" lang="en">Figure 3-4</link>. </para><figure float="0" id="chapter3-fig-4" lang="en"><gentext type="text">Figure 3-4 </gentext><title lang="en">Hanja characters</title><graphic filename="figures/fig3-4.gif" width="109" depth="57" scale="70" lang="en"/></figure><para lang="en">Hangul characters are formed by combining one or more consonant and vowel signs from a syllabary consisting of 24 basic elements called <emphasis lang="en">jamos.</emphasis> There are approximately 11,000 hangul characters. An example of hangul is shown in <link linkend="chapter3-fig-5" lang="en">Figure 3-5</link>.           </para><figure float="0" id="chapter3-fig-5" lang="en"><gentext type="text">Figure 3-5 </gentext><title lang="en">Hangul characters</title><graphic filename="figures/fig3-5.gif" width="112" depth="59" scale="70" lang="en"/></figure></sect1></chapter><chapter id="chapter4-1" lang="en"><gentext type="text">Chapter 4</gentext><gentext type="toc">4.  Technical Considerations</gentext><title lang="en">Technical Considerations</title><highlights lang="en"><para lang="en">The large number of ideographs needed to support the Traditional Chinese, Simplified Chinese, Japanese, and Korean writing systems cannot be represented in one byte, and are often called double-byte or <emphasis lang="en">multibyte</emphasis> languages, depending on the platform architecture. The Solaris operating environment supports multibyte encoding, representing characters in one, two, or more bytes.  </para><para lang="en">Separate software versions for multibyte locales need not be developed in the Solaris operating environment. However, there are issues unique to multibyte locale development--most importantly, that one character is not one byte in multibyte locales.</para></highlights><sect1 id="chapter4-2" lang="en"><title lang="en"><gentext type="text">4.1 </gentext>Asian-Specific Architecture</title><para lang="en">All localized versions of Solaris software are supersets of the U.S. English version and contain the same utilities and features. The difference between the U.S. English and a localized version is the addition of locale-specific data and tools facilitating input, display, and printing of local-language characters. All Asian versions of Solaris software include a locale database, user interface, and other locale-specific features. For example, <link linkend="chapter4-fig-1" lang="en">Figure 4-1</link> shows how the locale database fits into the Japanese Solaris architecture.</para><figure float="0" id="chapter4-fig-1" lang="en"><gentext type="text">Figure 4-1 </gentext><title lang="en">A Japanese Solaris architecture</title><graphic filename="figures/Fig4-1.eps.gif" width="759" depth="231" lang="en"/></figure><para lang="en">Specific features were also added to the Traditional Chinese, Simplified Chinese, Japanese, and Korean localized versions of the Solaris operating environment to address the following issues:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">The thousands of characters used in everyday communication</para></listitem><listitem lang="en"><para lang="en">Ideographs with multiple meanings depending on context or pronunciation</para></listitem></itemizedlist><sect2 id="chapter4-3" lang="en"><title lang="en"><gentext type="text">4.1.1 </gentext>Input Methods</title><para lang="en">How to enter thousands of characters is always an important issue in a multibyte language. Designing a keyboard with enough keys is simply not feasible. Instead, localized Solaris operating environments use <emphasis lang="en">input methods</emphasis>. Input methods (IMs) are system applications that convert keyboard input into a system-supported character. <link linkend="chapter4-fig-2" lang="en">Figure 4-2</link> shows how an input method works. </para><figure float="0" id="chapter4-fig-2" lang="en"><gentext type="text">Figure 4-2 </gentext><title lang="en">Input method</title><graphic filename="figures/Fig4-2.eps.gif" width="759" depth="180" lang="en"/></figure><para lang="en">Generally, the Motif text widget manages the input method. However, to customize the input method or have direct control, call the <computeroutput moreinfo="none" lang="en">X11 XIM</computeroutput> (<computeroutput moreinfo="none" lang="en">X Input Method</computeroutput>)  APIs.</para><note lang="en" role="note"><gentext type="text">Note - </gentext><para lang="en">An application cannot assume a one-to-one mapping between a key-input stroke and a character. A single character may require more than a one key-input stroke and a one key-input event may trigger the input of more than one character.</para></note></sect2><sect2 id="chapter4-4" lang="en"><title lang="en"><gentext type="text">4.1.2 </gentext>Character Conversion</title><para lang="en">In Chinese, Japanese, and Korean, more than one ideograph can correspond to an input string. To avoid confusion, the Solaris operating environment uses a <emphasis lang="en">Conversion Manager</emphasis> to display the possible dictionary choices in a window as shown in <link linkend="chapter4-fig-3" lang="en">Figure 4-3</link>. The pre-edit, status, and lookup choice areas are highlighted for the sample Simplified Chinese input-method.</para><figure float="0" id="chapter4-fig-3" lang="en"><gentext type="text">Figure 4-3 </gentext><title lang="en">The pre-edit, status, and lookup choice areas</title><graphic filename="figures/Fig4-3.tiff.gif" width="371" depth="181" scale="60" lang="en"/></figure><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">Pre-edit area--displays characters as entered</para></listitem><listitem lang="en"><para lang="en">Status area--displays whether conversion is activated and the states or mode of the input method</para></listitem><listitem lang="en"><para lang="en">Lookup choice area--displays ideographic choices for the corresponding phonetic representation</para></listitem></itemizedlist><note lang="en" role="note"><gentext type="text">Note - </gentext><para lang="en">Input methods and associated dictionaries are often referred to as <emphasis lang="en">language engines</emphasis>.</para></note><note lang="en" role="note"><gentext type="text">Note - </gentext><para lang="en">For more information on how to input Asian characters, see <emphasis lang="en">Section 3.1</emphasis> in the <citetitle lang="en">Unicode Support in the Solaris Operating Environment</citetitle>.</para></note></sect2><sect2 id="chapter4-5" lang="en"><title lang="en"><gentext type="text">4.1.3 </gentext>Input-Method Server</title><para lang="en">An input-method server (IM server) acts as the interface between input methods and applications as shown in <link linkend="chapter4-fig-4" lang="en">Figure 4-4</link>.</para><figure float="0" id="chapter4-fig-4" lang="en"><gentext type="text">Figure 4-4 </gentext><title lang="en">Input-method server</title><graphic filename="figures/Fig4-4.eps.gif" width="607" depth="419" scale="80" lang="en"/></figure><para lang="en">The IM server can support multiple language engines and provides user control over language-engine preferences, such as:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en">Method of displaying status string when the portion of the string under consideration for conversion loses input focus.</para></listitem><listitem lang="en"><para lang="en">Number of rows and columns in the input conversion candidate pop-up window.</para></listitem><listitem lang="en"><para lang="en">Whether input conversion candidate selection window is displayed.</para></listitem></itemizedlist><para lang="en">Many X toolkit-based applications automatically use the IM server for Asian text input. If you use any of Sun's toolkits (Motif, XViewTM, or OLIT), the input/output conversion process is transparent to the application.</para></sect2><sect2 id="chapter4-6" lang="en"><title lang="en"><gentext type="text">4.1.4 </gentext>Font Editor</title><para lang="en">The Font Editor is used to edit bitmap fonts. For example, a user may want to create a character not supported by the operating system because the repertoire of Han characters is too large. Using the Font Editor, new characters can be created and existing characters modified. </para><para lang="en">To start the Font Editor, type <computeroutput moreinfo="none" lang="en">fontedit</computeroutput> at the system prompt.</para></sect2><sect2 id="chapter4-7" lang="en"><title lang="en"><gentext type="text">4.1.5 </gentext>User-Defined Character Tool</title><para lang="en">The User-Defined Character Tool is used to create new characters as well as to specify font size for new characters. This utility can support both bitmap and Type 1 fonts. </para><para lang="en">To start the User-Defined Character Tool, type <computeroutput moreinfo="none" lang="en">sdtudctool</computeroutput> at the system prompt.</para></sect2></sect1></chapter><chapter id="chapter5-1" lang="en"><gentext type="text">Chapter 5</gentext><gentext type="toc">5.  Common Development Issues</gentext><title lang="en">Common Development Issues</title><highlights lang="en"><para lang="en">Writing and cultural conventions can vary greatly in different locales, such as character sets and numeric, time, date, and monetary formats. Some issues apply particularly to multibyte development.</para></highlights><sect1 id="chapter5-2" lang="en"><title lang="en"><gentext type="text">5.1 </gentext>Casing</title><para lang="en">Uppercase and lowercase words do not always apply in multibyte languages. For example, ideographs don't have case. Thus, characters that do not change after a casing function should not be treated as an error when calling an API which returns casing rules.  </para><para lang="en">The following APIs process multibyte characters:                                                  </para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">toupper()</computeroutput>: Convert wide characters to uppercase</para></listitem><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">tolower()</computeroutput>: Convert wide characters to lowercase</para></listitem><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">wctype()</computeroutput>: Define character class</para></listitem></itemizedlist></sect1><sect1 id="chapter5-3" lang="en"><title lang="en"><gentext type="text">5.2 </gentext>Sort Order</title><para lang="en">Sorting conventions vary widely across languages and locales. Some languages even have different rules for collating the same character. Sorting ideographs is different than sorting phonetic scripts and is based on either the form or pronunciation of characters.</para><para lang="en">A form-based system sorts first on the character's primary radical and then on the number of strokes to write the character (stroke count). A pronunciation-based system sorts first on ideograph pronunciation and then on stroke count.</para></sect1><sect1 id="chapter5-4" lang="en"><title lang="en"><gentext type="text">5.3 </gentext>Text Manipulation</title><para lang="en">When supporting multibyte languages, it is important to understand the difference between multibyte, wide and Unicode characters, and the impact of these on software development.  </para><para lang="en">In the Solaris operating environment, a multibyte character (or file code) is a sequence of one or more bytes terminated by a null string. Thus, a string may contain characters of different length. On the other hand, a wide character (or process code) is defined as a fixed-size number of bytes. In the Solaris operating environment, a wide character is defined to be four bytes long. The Solaris operating environment supports the Unicode UTF-8 format, a variable-length encoding similar to multibyte encoding </para><para lang="en">In many cases, there is no need to distinguish double-byte (or three-byte) characters from single-byte characters. It is simpler to convert multibyte strings (file code) to wide-character formats (process code) before manipulating or processing text data.  </para><para lang="en">The following APIs convert multibyte characters:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">mbstowcs()</computeroutput>: Convert multibyte string to wide-character string</para></listitem><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">mbstowc()</computeroutput>: Convert multibyte to wide-character code</para></listitem></itemizedlist><para lang="en">The following <computeroutput moreinfo="none" lang="en">wstring(3c)</computeroutput> APIs process multibyte characters:</para><itemizedlist lang="en" mark="bullet"><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">wcscmp()</computeroutput>: Compare wide-character strings</para></listitem><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">wcscpy()</computeroutput>: Copy wide-character strings</para></listitem><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">wcslen()</computeroutput>: Get length of wide-character string</para></listitem><listitem lang="en"><para lang="en"><computeroutput moreinfo="none" lang="en">wcschr()</computeroutput>: Find character in wide-character string</para></listitem></itemizedlist><para lang="en"/><note lang="en" role="note"><gentext type="text">Note - </gentext><para lang="en">File code is in multibyte format. Process code is in wide-character format. Do not assume particular character encodings of the process code.</para></note></sect1><sect1 id="chapter5-5" lang="en"><title lang="en"><gentext type="text">5.4 </gentext>Fonts</title><para lang="en">Mixed codeset strings cannot usually be rendered with a single font. A font set is a collection of fonts suitable for rendering all codesets in a locale's encoding, and includes data about the locale in which it was created. For example, in the Korean locale, both the ASCII and Korean fonts are loaded. This is known as <computeroutput moreinfo="none" lang="en">FontSet</computeroutput> in the X11 Window System. The number of fonts and their character-set registry in a <computeroutput moreinfo="none" lang="en">FontSet</computeroutput> vary from one locale to another. Because the Solaris operating environment manages the <computeroutput moreinfo="none" lang="en">FontSet</computeroutput> at run time, applications do not need to know that multiple fonts are being used. You just need to use <computeroutput moreinfo="none" lang="en">FontSet</computeroutput> interfaces.  </para><para lang="en">Common font family names, such as Times and Courier, are not usually available in multibyte locales. Locale-sensitive font family names should not be hard coded in applications. In the Common Desktop Environment, all locales have a common set of font alias names, such as <computeroutput moreinfo="none" lang="en">dt-application</computeroutput>.</para></sect1></chapter><appendix id="appendixa-1" lang="en"><gentext type="text">Appendix A</gentext><gentext type="toc">A.  Product Overview</gentext><title lang="en">Product Overview</title><highlights lang="en"><para lang="en">The following tables show features for the Common Desktop Environment (CDE), Simplified Chinese, Traditional Chinese, Japanese and Korean Solaris 8 operating environment. </para></highlights><sect1 id="appendixa-2" lang="en"><title lang="en"><gentext type="text">A.1 </gentext>Common Desktop Environment (CDE) Deskset</title><para lang="en">Table A-1 lists the CDE Deskset tools and Table A-2 lists the Asian printing tools.</para><table frame="topbot" id="appendixa-tbl-1" lang="en"><gentext type="text">Table A-1 </gentext><title lang="en">CDE Deskset Tools</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">DeskSet Tool</para></entry><entry lang="en"><para lang="en">Features</para></entry></row></thead><tbody lang="en"><row lang="en"><entry lang="en"><para lang="en">Style Manager</para></entry><entry lang="en"><para lang="en">Provides interactive customization of visual elements and system behavior for the desktop</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Calendar</para></entry><entry lang="en"><para lang="en">Enables group scheduling over the network, displays appointments and to-do items, sends automatic reminders using electronic mail</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Mailer</para></entry><entry lang="en"><para lang="en">Write, send, receive, and organize mail files including audio, image, and document files using simple drag-and-drop method</para></entry></row><row lang="en"><entry lang="en"><para lang="en">File
Manager</para></entry><entry lang="en"><para lang="en">Graphical way to navigate local and remote file systems: view, copy, or move files and documents, launch applications by point and click</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Printer</para></entry><entry lang="en"><para lang="en">An intuitive interface to UNIX printing utilities</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Tape Tool</para></entry><entry lang="en"><para lang="en">Interface to UNIX tape archiving and retrieval utilities</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Performance Meters</para></entry><entry lang="en"><para lang="en">Allows system use to be monitored graphically</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Audio Tool</para></entry><entry lang="en"><para lang="en">An application to record, playback, and edit audio files</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Image Tool</para></entry><entry lang="en"><para lang="en">View files in popular graphic formats
such as PostScript or TIFF</para></entry></row><row lang="en"><entry lang="en"><para lang="en">Text Editor</para></entry><entry lang="en"><para lang="en">An interactive text editor with mouse-based graphical interface</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">Snapshot</para></entry><entry colname="colspec1" lang="en"><para lang="en">Capture a black-and-white, grayscale, or color snapshot of the screen</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">Clock</para></entry><entry colname="colspec1" lang="en"><para lang="en">Displays the current time in a window or icon for any time zone around the world</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">Icon Editor</para></entry><entry colname="colspec1" lang="en"><para lang="en">A pixel editor that allows creation of customized icon images</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">Command/Shell Tool</para></entry><entry colname="colspec1" lang="en"><para lang="en">Standard UNIX shell that accepts SunOSTM system software commands</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">sdtudctool </para></entry><entry colname="colspec1" lang="en"><para lang="en">Tool for registering user-defined characters</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-2" lang="en"><gentext type="text">Table A-2 </gentext><title lang="en">Asian Printing Tools</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Asian Printing Tools</para></entry><entry lang="en"><para lang="en">Features</para></entry></row></thead><tbody lang="en"><row lang="en"><entry lang="en"><para lang="en">xetops</para></entry><entry lang="en"><para lang="en">Used in EUC/CSI locales to print Asian text files: <computeroutput moreinfo="none" lang="en">xetops Asian_Text_File |lp</computeroutput></para></entry></row><row lang="en"><entry lang="en"><para lang="en">mp</para></entry><entry lang="en"><para lang="en">Used in UTF-8 locales to print all UTF-8 characters: <computeroutput moreinfo="none" lang="en">mp UTF-8_File |lp</computeroutput></para></entry></row></tbody></tgroup></table><para lang="en">                                                                                                                                                                                                                                                                                                                                                                                                                                            For more information, see the appropriate <computeroutput moreinfo="none" lang="en">man</computeroutput> pages.</para></sect1><sect1 id="appendixa-3" lang="en"><title lang="en"><gentext type="text">A.2 </gentext>Simplified Chinese Solaris 8 Operating Environment Features</title><para lang="en">Table A-3 to Table A-6  give an overview of the Simplified Chinese Solaris 8 operating environment features.</para><table frame="topbot" id="appendixa-tbl-3" lang="en"><gentext type="text">Table A-3 </gentext><title lang="en">Simplified Chinese Codesets</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Description</para></entry><entry lang="en"><para lang="en">Supported Character Set</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh</para></entry><entry colname="colspec3" lang="en"><para lang="en">Simplified Chinese (EUC)  </para></entry><entry colname="colspec4" lang="en"><para lang="en">GB 2312-1980</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh.GBK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Simplified Chinese (GBK) </para></entry><entry colname="colspec4" lang="en"><para lang="en">GBK  </para></entry></row><row lang="en"><entry lang="en"><para lang="en">zh.UTF-8</para></entry><entry lang="en"><para lang="en">Simplified Chinese (UTF-8) </para></entry><entry lang="en"><para lang="en">Unicode
 3.0</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-4" lang="en"><gentext type="text">Table A-4 </gentext><title lang="en">Simplified Chinese Input Methods</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Input Methods</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">New QuanPin</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">New ShuangPin</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Quanpy</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh</para></entry><entry colname="colspec1" lang="en"><para lang="en">PinYin</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh</para></entry><entry colname="colspec1" lang="en"><para lang="en">Stroke</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh</para></entry><entry colname="colspec1" lang="en"><para lang="en">Golden</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh</para></entry><entry colname="colspec1" lang="en"><para lang="en">Intelligent PinYin</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh</para></entry><entry colname="colspec1" lang="en"><para lang="en">Simplified Chinese Symbol</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">GBK Code</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Japanese</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Hanja</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Zhuyin</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Unicode Hex and Unicode Octal</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-5" lang="en"><gentext type="text">Table A-5 </gentext><title lang="en">Simplified Chinese Fonts</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Full Family Name</para></entry><entry lang="en"><para lang="en">Format</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh, zh.GBK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Fangson</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh, zh.GBK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Hei</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh, zh.GBK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Kai</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh, zh.GBK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Song</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh, zh.GBK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Song</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-6" lang="en"><gentext type="text">Table A-6 </gentext><title lang="en">Simplified Chinese Codeset Conversions</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Codeset Conversion Supported</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">GB2312-80</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ISO-2022-7</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ISO-2022-CN</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">UTF-8</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">GBK</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">BIG5</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh, zh.GBK, zh.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">HZ-GB-2312</para></entry></row></tbody></tgroup></table></sect1><sect1 id="appendixa-4" lang="en"><title lang="en"><gentext type="text">A.3 </gentext>Traditional Chinese Solaris 8 Operating Environment Features</title><para lang="en">Table A-7 to Table A-10 give an overview of the Traditional Chinese Solaris 8 operating environment features.</para><table frame="topbot" id="appendixa-tbl-7" lang="en"><gentext type="text">Table A-7 </gentext><title lang="en">Traditional Chinese Codesets</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Description</para></entry><entry lang="en"><para lang="en">Supported Character Set</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh_TW</para></entry><entry colname="colspec3" lang="en"><para lang="en">Traditional Chinese (EUC)  </para></entry><entry colname="colspec4" lang="en"><para lang="en">CNS 11643 1992</para></entry></row><row lang="en"><entry lang="en"><para lang="en">zh_TW.BIG5</para></entry><entry lang="en"><para lang="en">Traditional Chinese (BIG5) </para></entry><entry lang="en"><para lang="en">BIG5  </para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh_TW.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Traditional Chinese (UTF-8) </para></entry><entry colname="colspec4" lang="en"><para lang="en">Unicode 3.0</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-8" lang="en"><gentext type="text">Table A-8 </gentext><title lang="en">Traditional Chinese Input Methods</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Input Methods</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Chuyin</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">I-Tien</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Telecode</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">TsangChieh</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">CheinI</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">NeiMA</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ChuangHsing</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Array</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">BoShiaMy</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">DaYi</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Unicode Hex and Unicode Octal</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-9" lang="en"><gentext type="text">Table A-9 </gentext><title lang="en">Traditional Chinese Fonts</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Full Family Name</para></entry><entry lang="en"><para lang="en">Format</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh_TW, zh_TW.BIG5</para></entry><entry colname="colspec3" lang="en"><para lang="en">Hei</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh_TW, zh_TW.BIG5</para></entry><entry colname="colspec3" lang="en"><para lang="en">Kai</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh_TW, zh_TW.BIG5</para></entry><entry colname="colspec3" lang="en"><para lang="en">Ming</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">zh_TW, zh_TW.BIG5</para></entry><entry colname="colspec3" lang="en"><para lang="en">Ming</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-10" lang="en"><gentext type="text">Table A-10 </gentext><title lang="en">Traditional Chinese Codeset Conversions</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Codeset Conversion Supported</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">CNS 11643</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">BIG5</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ISO-2022-7</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ISO-2022-CN-EXT</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">UTF-8</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">zh_TW, zh_TW.BIG5, zh_TW.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">BIG5 Plus</para></entry></row></tbody></tgroup></table></sect1><sect1 id="appendixa-5" lang="en"><title lang="en"><gentext type="text">A.4 </gentext>Japanese Solaris 8 Operating Environment Features</title><para lang="en">Table A-11 to Table A-14 give an overview of the Japanese Solaris 8 operating environment features.</para><table frame="topbot" id="appendixa-tbl-11" lang="en"><gentext type="text">Table A-11 </gentext><title lang="en">Japanese Codesets</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Description</para></entry><entry lang="en"><para lang="en">Supported Character Set</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ja</para></entry><entry colname="colspec3" lang="en"><para lang="en">Japanese (EUC)  </para></entry><entry colname="colspec4" lang="en"><para lang="en">JIS x 0201, JIS x 0208, JIS x 0212, UDC, VDC</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ja_JP.PCK</para></entry><entry colname="colspec3" lang="en"><para lang="en">Japanese (PCK) </para></entry><entry colname="colspec4" lang="en"><para lang="en"> JIS x 0201, JIS x 0208, UDC, VDC</para></entry></row><row lang="en"><entry lang="en"><para lang="en">ja_JP.UTF-8</para></entry><entry lang="en"><para lang="en">Japanese (UTF-8) </para></entry><entry lang="en"><para lang="en">  Unicode 3.0</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-12" lang="en"><gentext type="text">Table A-12 </gentext><title lang="en">Japanese Input Methods</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="194*"/><colspec colname="colspec1" colwidth="480*"/><thead lang="en"><row rowsep="1" lang="en"><entry align="left" valign="bottom" lang="en"><para lang="en">Locale Name</para></entry><entry align="left" valign="bottom" lang="en"><para lang="en">Input Methods</para></entry></row></thead><tbody lang="en"><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">ATOK12</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">Wnn6</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">cs00</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ATOK8</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">Unicode Hex and Unicode Octal</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-13" lang="en"><gentext type="text">Table A-13 </gentext><title lang="en">Japanese Fonts</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Full Family Name</para></entry><entry lang="en"><para lang="en">Format</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">hg gothic b</para></entry><entry colname="colspec3" lang="en"><para lang="en">True Type</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">hg mincho 1</para></entry><entry colname="colspec3" lang="en"><para lang="en">True Type</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">heiseimin</para></entry><entry colname="colspec3" lang="en"><para lang="en">True Type</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">gothich</para></entry><entry colname="colspec3" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">minchou</para></entry><entry colname="colspec3" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">hg gothic b</para></entry><entry colname="colspec3" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">hg minchou 1</para></entry><entry colname="colspec3" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">heiseimin</para></entry><entry colname="colspec3" lang="en"><para lang="en">PCF (12,14,16,20,24)</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-14" lang="en"><gentext type="text">Table A-14 </gentext><title lang="en">Japanese Codeset Conversions</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="194*"/><colspec colname="colspec1" colwidth="480*"/><thead lang="en"><row rowsep="1" lang="en"><entry align="left" valign="bottom" lang="en"><para lang="en">Locale Name</para></entry><entry align="left" valign="bottom" lang="en"><para lang="en">Codeset Conversion Supported</para></entry></row></thead><tbody lang="en"><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">eucJP</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">SJIS</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">PCK</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">ISO-2022-JP</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">UTF-8</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">JIS7</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">jis</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibmj</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">UTF-8-Java </para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibmj-EBCDIK</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibm-930</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibm-931</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibm-939</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibm-5026</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ibm-5035</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ms-932</para></entry></row><row lang="en"><entry align="left" valign="top" lang="en"><para lang="en">ja, ja_JP.PCK, ja_JP.UTF-8</para></entry><entry align="left" valign="top" lang="en"><para lang="en">UTF-8-ms932</para></entry></row></tbody></tgroup></table></sect1><sect1 id="appendixa-6" lang="en"><title lang="en"><gentext type="text">A.5 </gentext>Korean Solaris 8 Operating Environment Features</title><para lang="en">Table A-15 to Table A-18 give an overview of the Korean Solaris 8 operating environment features.</para><table frame="topbot" id="appendixa-tbl-15" lang="en"><gentext type="text">Table A-15 </gentext><title lang="en">Korean Codesets</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Description</para></entry><entry lang="en"><para lang="en">Supported Character Set</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko</para></entry><entry colname="colspec3" lang="en"><para lang="en">Korean (EUC)  </para></entry><entry colname="colspec4" lang="en"><para lang="en">KS C 5601-1992</para></entry></row><row lang="en"><entry lang="en"><para lang="en">ko.UTF-8</para></entry><entry lang="en"><para lang="en">Korean (UTF-8) </para></entry><entry lang="en"><para lang="en">Unicode 3.0 </para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-16" lang="en"><gentext type="text">Table A-16 </gentext><title lang="en">Korean Input Methods</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Input Methods</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Hangul 2-BeolSik (1 set of consonants and 1 set of vowels )</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Hangul-Hanja conversion</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Special character</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Unicode Hex and Unicode Octal</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-17" lang="en"><gentext type="text">Table A-17 </gentext><title lang="en">Korean Fonts</title><tgroup cols="3" colsep="0" rowsep="0" lang="en"><colspec colname="colspec2" colwidth="32.82*"/><colspec colname="colspec3" colwidth="32.26*"/><colspec colname="colspec4" colwidth="33.93*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Full Family Name</para></entry><entry lang="en"><para lang="en">Format</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Gothic</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Haeso</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Kodig</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Myeongijo</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Roundgothic</para></entry><entry colname="colspec4" lang="en"><para lang="en">TrueType</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Gothic</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Graphic</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Haeso</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Kodig</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Myeongijo</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Pilki</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row><row lang="en"><entry colname="colspec2" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec3" lang="en"><para lang="en">Roundgothic</para></entry><entry colname="colspec4" lang="en"><para lang="en">PCF (14,16,18,20,24)</para></entry></row></tbody></tgroup></table><table frame="topbot" id="appendixa-tbl-18" lang="en"><gentext type="text">Table A-18 </gentext><title lang="en">Korean Codeset Conversions</title><tgroup cols="2" colsep="0" rowsep="0" lang="en"><colspec colname="colspec0" colwidth="28.89*"/><colspec colname="colspec1" colwidth="71.11*"/><thead lang="en"><row rowsep="1" lang="en"><entry lang="en"><para lang="en">Locale Name</para></entry><entry lang="en"><para lang="en">Codeset Conversion Supported</para></entry></row></thead><tbody lang="en"><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">KSC 5601-1987</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ISO 646</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">UTF-8</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">IBM CP933</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">ISO 2022-KR</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">KSC 5601-1987-Johap</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">KSC 5601-1992-Johap</para></entry></row><row lang="en"><entry colname="colspec0" lang="en"><para lang="en">ko, ko.UTF-8</para></entry><entry colname="colspec1" lang="en"><para lang="en">Unified Hangul</para></entry></row></tbody></tgroup></table><sect2 id="appendixa-6-1" lang="en"><title lang="en"><gentext type="text">A.5.1 </gentext>Korean Dictionary Tools</title><para lang="en">Hanja Tool expands the capabilities of the standard Korean Solaris operating environment hangul-hanja conversion mode by adding hanja ideograms and managing available lookup choices for hangul-hanja conversion. It works only with words of more than one syllable.</para></sect2></sect1></appendix></book>