All text is internally processed using the Unicode character set UTF-16. The maximum size of a text is 2 GB in bytes, leading to a maximum text length of 1 billion characters (1.073.741.824).
Functions such as [[BLOB_TO_TEXT]] and [[HTTPGET_TEXT]] supports exchange with other popular character sets such as:
- `ASCII`
- `BigEndianUnicode`
- `UTF7`
- `Unicode`
- `UTF32`
- `UTF8`
Other character sets supported depend on the platform, but typically include:
- `ASMO-708` - Arabic (ASMO 708)
- `DOS-720` - Arabic (DOS)
- `DOS-862` - Hebrew (DOS)
- `EUC-JP` - Japanese (JIS 0208-1990 and 0212-1990)
- `IBM-Thai` - IBM EBCDIC (Thai)
- `IBM00858` - OEM Multilingual Latin I
- `IBM00924` - IBM Latin-1
- `IBM01047` - IBM Latin-1
- `IBM01140` - IBM EBCDIC (US-Canada-Euro)
- `IBM01141` - IBM EBCDIC (Germany-Euro)
- `IBM01142` - IBM EBCDIC (Denmark-Norway-Euro)
- `IBM01143` - IBM EBCDIC (Finland-Sweden-Euro)
- `IBM01144` - IBM EBCDIC (Italy-Euro)
- `IBM01145` - IBM EBCDIC (Spain-Euro)
- `IBM01146` - IBM EBCDIC (UK-Euro)
- `IBM01147` - IBM EBCDIC (France-Euro)
- `IBM01148` - IBM EBCDIC (International-Euro)
- `IBM01149` - IBM EBCDIC (Icelandic-Euro)
- `IBM037` - IBM EBCDIC (US-Canada)
- `IBM1026` - IBM EBCDIC (Turkish Latin-5)
- `IBM273` - IBM EBCDIC (Germany)
- `IBM277` - IBM EBCDIC (Denmark-Norway)
- `IBM278` - IBM EBCDIC (Finland-Sweden)
- `IBM280` - IBM EBCDIC (Italy)
- `IBM284` - IBM EBCDIC (Spain)
- `IBM285` - IBM EBCDIC (UK)
- `IBM290` - IBM EBCDIC (Japanese katakana)
- `IBM297` - IBM EBCDIC (France)
- `IBM420` - IBM EBCDIC (Arabic)
- `IBM423` - IBM EBCDIC (Greek)
- `IBM424` - IBM EBCDIC (Hebrew)
- `IBM437` - OEM United States
- `IBM500` - IBM EBCDIC (International)
- `IBM855` - OEM Cyrillic
- `IBM860` - Portuguese (DOS)
- `IBM863` - French Canadian (DOS)
- `IBM864` - Arabic (864)
- `IBM865` - Nordic (DOS)
- `IBM870` - IBM EBCDIC (Multilingual Latin-2)
- `IBM871` - IBM EBCDIC (Icelandic)
- `IBM880` - IBM EBCDIC (Cyrillic Russian)
- `IBM905` - IBM EBCDIC (Turkish)
- `Johab` - Korean (Johab)
- `big5` - Chinese Traditional (Big5)
- `cp1025` - IBM EBCDIC (Cyrillic Serbian-Bulgarian)
- `cp866` - Cyrillic (DOS)
- `cp875` - IBM EBCDIC (Greek Modern)
- `gb2312` - Chinese Simplified (GB2312)
- `ibm737` - Greek (DOS)
- `ibm775` - Baltic (DOS)
- `ibm850` - Western European (DOS)
- `ibm852` - Central European (DOS)
- `ibm857` - Turkish (DOS)
- `ibm861` - Icelandic (DOS)
- `ibm869` - Greek, Modern (DOS)
- `iso-8859-1` - Western European (ISO)
- `iso-8859-13` - Estonian (ISO)
- `iso-8859-15` - Latin 9 (ISO)
- `iso-8859-2` - Central European (ISO)
- `iso-8859-3` - Latin 3 (ISO)
- `iso-8859-4` - Baltic (ISO)
- `iso-8859-5` - Cyrillic (ISO)
- `iso-8859-6` - Arabic (ISO)
- `iso-8859-7` - Greek (ISO)
- `iso-8859-8` - Hebrew (ISO-Visual)
- `iso-8859-9` - Turkish (ISO)
- `koi8-r` - Cyrillic (KOI8-R)
- `koi8-u` - Cyrillic (KOI8-U)
- `ks_c_5601-1987` - Korean
- `macintosh` - Western European (Mac)
- `shift_jis` - Japanese (Shift-JIS)
- `us-ascii` - US-ASCII
- `utf-16` - Unicode
- `utf-16BE` - Unicode (Big-Endian)
- `utf-32` - Unicode (UTF-32)
- `utf-32BE` - Unicode (UTF-32 Big-Endian)
- `utf-8` - Unicode (UTF-8)
- `windows-1250` - Central European (Windows)
- `windows-1251` - Cyrillic (Windows)
- `windows-1252` - Western European (Windows)
- `windows-1253` - Greek (Windows)
- `windows-1254` - Turkish (Windows)
- `windows-1255` - Hebrew (Windows)
- `windows-1256` - Arabic (Windows)
- `windows-1257` - Baltic (Windows)
- `windows-1258` - Vietnamese (Windows)
- `windows-874` - Thai (Windows)
- `x-Chinese-CNS` - Chinese Traditional (CNS)
- `x-Chinese-Eten` - Chinese Traditional (Eten)
- `x-Europa` - Europa
- `x-IA5` - Western European (IA5)
- `x-IA5-German` - German (IA5)
- `x-IA5-Norwegian` - Norwegian (IA5)
- `x-IA5-Swedish` - Swedish (IA5)
- `x-cp20001` - TCA Taiwan
- `x-cp20003` - IBM5550 Taiwan
- `x-cp20004` - TeleText Taiwan
- `x-cp20005` - Wang Taiwan
- `x-cp20261` - T.61
- `x-cp20269` - ISO-6937
- `x-cp20936` - Chinese Simplified (GB2312-80)
- `x-cp20949` - Korean Wansung
- `x-ebcdic-koreanextended` - IBM EBCDIC (Korean Extended)
- `x-mac-arabic` - Arabic (Mac)
- `x-mac-ce` - Central European (Mac)
- `x-mac-chinesetrad` - Chinese Traditional (Mac)
- `x-mac-croatian` - Croatian (Mac)
- `x-mac-cyrillic` - Cyrillic (Mac)
- `x-mac-greek` - Greek (Mac)
- `x-mac-hebrew` - Hebrew (Mac)
- `x-mac-icelandic` - Icelandic (Mac)
- `x-mac-japanese` - Japanese (Mac)
- `x-mac-romanian` - Romanian (Mac)
- `x-mac-thai` - Thai (Mac)
- `x-mac-turkish` - Turkish (Mac)
- `x-mac-ukrainian` - Ukrainian (Mac)