EasyManua.ls Logo

Motorola g20 - Character Sets; ASCII Character Set Management; GSM Character Set Management; UCS2 Character Set Management

Motorola g20
352 pages
Print Icon
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Loading...
98-08901C68-O 15
Product Features
2.8 CHARACTER SETS
The following lists references to various tables that provide conversions between the different character sets.
For the full content of a specific conversion table, refer to Appendix A, Character Set Tables.
2.8.1 ASCII Character Set Management
The American Standard Code for Information Interchange (ASCII) is a standard seven-bit code that was proposed by ANSI in
1963, and finalized in 1968. ASCII was established to achieve compatibility between various types of data processing
equipment. Later-day standards that document ASCII include ISO-14962-1997 and ANSI-X3.4-1986 (R1997).
2.8.2 GSM Character Set Management
GSM is the default alphabet, as described in section 8.7 (GSM character table) .
g20 can store messages coded in any alphabet on the SIM, irrespective of support of an individual alphabet.
The default alphabet is based on 7bit characters.
For more information, refer to ETSI GSM 3.38 v561.
2.8.3 UCS2 Character Set Management
UCS is the first officially standardized coded character set, eventually to include the characters of all the written languages in
the world, as well as all mathematical and other symbols.
Unicode can be characterized as the (restricted) 2-octet form of UCS on (the most general) implementation level 3, with the
addition of a more precise specification of the bi-directional behavior of characters, as used in the Arabic and Hebrew scripts.
The 65,536 positions in the 2-octet form of UCS are divided into 256 rows with 256 cells in each. The first octet of a character
representation denotes the row number, the second the cell number. The first row (row 0) contains exactly the same characters
as ISO/IEC 8859-1. The first 128 characters are thus the ASCII characters. The octet representing an ISO/IEC 8859-1 character
is easily transformed to the representation in UCS by placing a 0 octet in front of it. UCS includes the same control characters
as ISO/IEC 8859 (also in row 0).
Table 2. References to Character Set Conversion Tables
From \ To GSM ASCII UTF8 UCS2 ISO-8859-1
ETSI 03.38 GSM Table CS5 Table CS1
ASCII Table CS7 Table CS2
UTF8 Table CS2 Table CS3
ISO/IEC 10646 UCS2 Table CS3 Table CS6
ISO/IEC 8859-1 ISO-8859-1 Tabl e CS4

Table of Contents

Related product manuals