All text is internally processed using the Unicode character set UTF-16. The maximum size of a text is 2 GB in bytes, leading to a maximum text length of 1 billion characters (1.073.741.824). Functions such as [[BLOB_TO_TEXT]] and [[HTTPGET_TEXT]] supports exchange with other popular character sets such as: - `ASCII` - `BigEndianUnicode` - `UTF7` - `Unicode` - `UTF32` - `UTF8` Other character sets supported depend on the platform, but typically include: - `ASMO-708` - Arabic (ASMO 708) - `DOS-720` - Arabic (DOS) - `DOS-862` - Hebrew (DOS) - `EUC-JP` - Japanese (JIS 0208-1990 and 0212-1990) - `IBM-Thai` - IBM EBCDIC (Thai) - `IBM00858` - OEM Multilingual Latin I - `IBM00924` - IBM Latin-1 - `IBM01047` - IBM Latin-1 - `IBM01140` - IBM EBCDIC (US-Canada-Euro) - `IBM01141` - IBM EBCDIC (Germany-Euro) - `IBM01142` - IBM EBCDIC (Denmark-Norway-Euro) - `IBM01143` - IBM EBCDIC (Finland-Sweden-Euro) - `IBM01144` - IBM EBCDIC (Italy-Euro) - `IBM01145` - IBM EBCDIC (Spain-Euro) - `IBM01146` - IBM EBCDIC (UK-Euro) - `IBM01147` - IBM EBCDIC (France-Euro) - `IBM01148` - IBM EBCDIC (International-Euro) - `IBM01149` - IBM EBCDIC (Icelandic-Euro) - `IBM037` - IBM EBCDIC (US-Canada) - `IBM1026` - IBM EBCDIC (Turkish Latin-5) - `IBM273` - IBM EBCDIC (Germany) - `IBM277` - IBM EBCDIC (Denmark-Norway) - `IBM278` - IBM EBCDIC (Finland-Sweden) - `IBM280` - IBM EBCDIC (Italy) - `IBM284` - IBM EBCDIC (Spain) - `IBM285` - IBM EBCDIC (UK) - `IBM290` - IBM EBCDIC (Japanese katakana) - `IBM297` - IBM EBCDIC (France) - `IBM420` - IBM EBCDIC (Arabic) - `IBM423` - IBM EBCDIC (Greek) - `IBM424` - IBM EBCDIC (Hebrew) - `IBM437` - OEM United States - `IBM500` - IBM EBCDIC (International) - `IBM855` - OEM Cyrillic - `IBM860` - Portuguese (DOS) - `IBM863` - French Canadian (DOS) - `IBM864` - Arabic (864) - `IBM865` - Nordic (DOS) - `IBM870` - IBM EBCDIC (Multilingual Latin-2) - `IBM871` - IBM EBCDIC (Icelandic) - `IBM880` - IBM EBCDIC (Cyrillic Russian) - `IBM905` - IBM EBCDIC (Turkish) - `Johab` - Korean (Johab) - `big5` - Chinese Traditional (Big5) - `cp1025` - IBM EBCDIC (Cyrillic Serbian-Bulgarian) - `cp866` - Cyrillic (DOS) - `cp875` - IBM EBCDIC (Greek Modern) - `gb2312` - Chinese Simplified (GB2312) - `ibm737` - Greek (DOS) - `ibm775` - Baltic (DOS) - `ibm850` - Western European (DOS) - `ibm852` - Central European (DOS) - `ibm857` - Turkish (DOS) - `ibm861` - Icelandic (DOS) - `ibm869` - Greek, Modern (DOS) - `iso-8859-1` - Western European (ISO) - `iso-8859-13` - Estonian (ISO) - `iso-8859-15` - Latin 9 (ISO) - `iso-8859-2` - Central European (ISO) - `iso-8859-3` - Latin 3 (ISO) - `iso-8859-4` - Baltic (ISO) - `iso-8859-5` - Cyrillic (ISO) - `iso-8859-6` - Arabic (ISO) - `iso-8859-7` - Greek (ISO) - `iso-8859-8` - Hebrew (ISO-Visual) - `iso-8859-9` - Turkish (ISO) - `koi8-r` - Cyrillic (KOI8-R) - `koi8-u` - Cyrillic (KOI8-U) - `ks_c_5601-1987` - Korean - `macintosh` - Western European (Mac) - `shift_jis` - Japanese (Shift-JIS) - `us-ascii` - US-ASCII - `utf-16` - Unicode - `utf-16BE` - Unicode (Big-Endian) - `utf-32` - Unicode (UTF-32) - `utf-32BE` - Unicode (UTF-32 Big-Endian) - `utf-8` - Unicode (UTF-8) - `windows-1250` - Central European (Windows) - `windows-1251` - Cyrillic (Windows) - `windows-1252` - Western European (Windows) - `windows-1253` - Greek (Windows) - `windows-1254` - Turkish (Windows) - `windows-1255` - Hebrew (Windows) - `windows-1256` - Arabic (Windows) - `windows-1257` - Baltic (Windows) - `windows-1258` - Vietnamese (Windows) - `windows-874` - Thai (Windows) - `x-Chinese-CNS` - Chinese Traditional (CNS) - `x-Chinese-Eten` - Chinese Traditional (Eten) - `x-Europa` - Europa - `x-IA5` - Western European (IA5) - `x-IA5-German` - German (IA5) - `x-IA5-Norwegian` - Norwegian (IA5) - `x-IA5-Swedish` - Swedish (IA5) - `x-cp20001` - TCA Taiwan - `x-cp20003` - IBM5550 Taiwan - `x-cp20004` - TeleText Taiwan - `x-cp20005` - Wang Taiwan - `x-cp20261` - T.61 - `x-cp20269` - ISO-6937 - `x-cp20936` - Chinese Simplified (GB2312-80) - `x-cp20949` - Korean Wansung - `x-ebcdic-koreanextended` - IBM EBCDIC (Korean Extended) - `x-mac-arabic` - Arabic (Mac) - `x-mac-ce` - Central European (Mac) - `x-mac-chinesetrad` - Chinese Traditional (Mac) - `x-mac-croatian` - Croatian (Mac) - `x-mac-cyrillic` - Cyrillic (Mac) - `x-mac-greek` - Greek (Mac) - `x-mac-hebrew` - Hebrew (Mac) - `x-mac-icelandic` - Icelandic (Mac) - `x-mac-japanese` - Japanese (Mac) - `x-mac-romanian` - Romanian (Mac) - `x-mac-thai` - Thai (Mac) - `x-mac-turkish` - Turkish (Mac) - `x-mac-ukrainian` - Ukrainian (Mac)