Appendix A: Reference Tables
December 31, 2007 AT Commands Reference Manual A-35
Character Set Table CS2: (ASCII <-> UTF-8)
The following table shows the conversion between the ASCII and UTF-8 character sets.
Character Set Table CS3: (UCS-2 <-> UTF-8)
The following table shows the conversion between the UCS-2 and UTF-8 character sets.
Note: Conversion from the default GSM alphabet to the above character set is straightforward.
Conversions of the characters listed below the table are not supplied.
Character Set Table CS6: (UCS-2 Full table)
Character Set Table CS6: (UCS-2) is provided on CD due to its size.
Character Set Table CS7: (ASCII table)
The following table shows the conversion for the ASCII character set.
ASCII-7bit Byte Encoding UTF-8 Bit Encoding
00 - 7F 0xxxxxxx
UCS2 UTF-8
Byte Encoding Bit Encoding Byte 1 Byte 2 Byte 3
0000 - 007F 000000000xxxxxx
x
0xxxxxxx
0080 - 07FF 00000yyyyyxxxxx
x
110yyyyy 10xxxxxx
0800 - FFFF zzzzyyyyyyxxxxxx 1110zzzz 10yyyyyy 10xxxxxx
Decimal Octal Hex Binary Value Description
000 000 000 00000000 NUL (Null char.)
001 001 001 00000001 SOH (Start of
Header)
002 002 002 00000010 STX (Start of Text)
003 003 003 00000011 ETX (End of Text)
004 004 004 00000100 EOT (End of
Transmission)