Contained WithinFind More DocumentationFeatured Support Resources | Download this book in PDF (3243 KB)
Indic LocalizationPhonetic lookup based input method (Shabdalipi) and continuous phonetic input method are available for all Indic languages which are supported in the UTF-8 locale. The input methods and virtual keyboards allow you to enter Indic text in all of the CDE applications. The following data flow illustrates the workings of the Indic input
process. How to Use the Indic Input Methods
Indic KeyboardsThe following figures show the keyboard layouts that are available for the Indic input method. The following figure shows the layout of the Bengali keyboard. ![]() The following figure shows the layout of the Devanagari keyboard. ![]() The following figure shows the layout of the Gujarati keyboard. ![]() The following figure shows the layout of the Gurmukhi keyboard. ![]() The following figure shows the layout of the Kannada keyboard. ![]() The following figure shows the layout of the Malayalam keyboard. ![]() The following figure shows the layout of the Tamil keyboard. ![]() The following figure shows the layout of the Teluga keyboard.
Understanding the MappingsThe images in Mapping for the Continuous Phonetic Based Input Method show the mappings between English tokens and their equivalent codepoints in each of the target scripts supported. The CONSONANT category means the mapping is between the English tokens and consonants of the script. The VOWEL category means that mapping from English tokens and vowels of the script. The OTHER category includes mapping of characters that do not exhibit the properties of consonants and vowels (whose form does not change depending on the surrounding character). The keywords CONSONANT, VOWEL and OTHER also mean that these characters are part of Unicode standard. The section SPECIAL CONSONANT, SPECIAL VOWEL or SPECIAL OTHER means that though in principle these characters display the properties of consonants, vowels or others they are not officially part of the Unicode standard and are font dependent. They are assigned codepoint values in Unicode Private User Area. They are supported in Solaris UTF-8 locales and the mapping may not work in a different platform. These mapfiles are not the same as the ones in your system, but slightly edited ones for removing unneeded keywords for the context of this discussion. In the VOWELS and SPECIAL VOWELS section, an independent form and a dependent form is displayed for the same English token depending on the context. See How the Continuous Phonetic Input Method Works. The malayalam script contains a special ‘CHILLU’ section, that is actually the SPECIAL OTHER category. Mapping for the Continuous Phonetic Based Input MethodThe following figures show the existing mappings from English to the phonetic equivalent characters in the target Indic scripts. Use these illustrations as a reference until you know all the mappings for the script that you use. Mappings given here are intuitive, so you should be able to input most of the characters without looking up the illustration. Note – In these mappings, special characters such as ‘.’ and ‘|’ included as part of the mapping are escaped with a ‘\’ character. If not escaped, the ‘|’ character acts as a separator when more than one token represents the same UTF-8 character. Figure 4–1, Figure 4–2, and Figure 4–3 show the English to Bengali mappings for consonants, vowels, and others. Figure 4–1 Map for Bengali Consonants
Figure 4–2 Map for Bengali Vowels
Figure 4–3 Map for Bengali Others
Figure 4–4, Figure 4–5, and Figure 4–6 show the English to Gujarati mappings for consonants, vowels, and others. Figure 4–4 Map for Gujarati Consonants
Figure 4–5 Map for Gujarati Vowels
Figure 4–6 Map for Gujarati Others
Figure 4–7, Figure 4–8, and Figure 4–9 show the English to Gurmukhi mappings for consonants, vowels, and others. Figure 4–7 Map for Gurmukhi Consonants
Figure 4–8 Map for Gurmukhi Vowels
Figure 4–9 Map for Gurmukhi Others
Figure 4–10, Figure 4–11, and Figure 4–12 show the English to Hindi mappings for consonants, vowels, and others. Figure 4–10 Map for Hindi Consonants
Figure 4–11 Map for Hindi Vowels
Figure 4–12 Map for Hindi Others
Figure 4–13, Figure 4–14, and Figure 4–15 show the English to Kannada mappings for consonants, vowels, and others. Figure 4–13 Map for Kannada Consonants
Figure 4–14 Map for Kannada Vowels
Figure 4–15 Map for Kannada Others
Figure 4–16, Figure 4–17, and Figure 4–18 show the English to Malayalam mappings for consonants, vowels, and others. Figure 4–16 Map for Malayalam Consonants
Figure 4–17 Map for Malayalam Vowels
Figure 4–18 Map for Malayalam Others
Figure 4–19 and Figure 4–20 show the English to Tamil mappings for consonants and vowels. Figure 4–19 Map for Tamil Consonants
Figure 4–20 Map for Tamil Vowels
Figure 4–21,Figure 4–22, and Figure 4–23 show the English to Telugu mappings for consonants, vowels, and others. Figure 4–21 Map for Telugu Consonants
Figure 4–22 Map for Telugu Vowels
Figure 4–23 Map for Telugu Others
How the Continuous Phonetic Input Method WorksFor each Indic script, a ‘virama’ or equivalent sign combined with a consonant gives the half form (or ready to combine form) of the consonant. Whenever a multiple key combination corresponding to a consonant is typed, the consonant + virama form is output, symbolizing that the characters are ready to combine. Consonants, at initial input, will assume their half form and will be a full syllable or their variation when followed by a vowel. Two consecutive consonants remain as the ready to combine half forms. Half forms can be converted by the layout engine as a single combined character or can remain as those independent forms that are also syntactically valid for every language. Any vowel that forms the beginning of a word or is followed by another vowel appears in independent form. A vowel that immediately follows a consonant assumes dependent forms. Characters that do not change shapes in any context are called others. These characters are neither consonants nor vowels. Digits and other punctuation marks that do not form a part of a character are mapped one to one. Using these principles, a parser is written that will parse the input into these different categories and output the language-specific Unicode codepoints. The continuous phonetic input method engine does not deal with layout or rendering, which will be done by other modules in the system. |