Subscribe
to our newsletter

 




 

Buy Now Download Tour

This is a list of the 145 Unicode conversions and encoding conversions offered by TextPipe Pro, in addition to 151 code page conversions.

  • Convert Unicode to ANSI
  • Convert ANSI to Unicode
  • Convert Unicode to ASCII
  • Convert ASCII to Unicode

The Unicode conversions are found under Filters Menu\Unicode.

Unicode Normalization filters:

  • NFC - Canonical Decomposition, followed by Canonical Composition
  • NFD - Canonical Decomposition
  • NFKD - Compatibility Decomposition
  • NFKC - Compatibility Decomposition, followed by Canonical Composition
  • Compose

Conversions between Unicode and:

European languages
  • ASCII
  • ISO-8859-1 (Western)
  • ISO-8859-2 (Central European)
  • ISO-8859-3 (South European)
  • ISO-8859-4 (Baltic)
  • ISO-8859-5 (Cyrillic)
  • ISO-8859-7 (Greek)
  • ISO-8859-9 (Turkish)
  • ISO-8859-10 (Nordic)
  • ISO-8859-13 (Baltic)
  • ISO-8859-14 (Celtic)
  • ISO-8859-15 (Western)
  • ISO-8859-16 (Romanian)
  • Windows 1250 (Central Europe)
  • Windows 1251 (Cyrillic)
  • Windows 1252 (Latin 1)
  • Windows 1253 (Greek)
  • Windows 1254 (Turkish)
  • Windows 1255 (Hebrew)
  • Windows 1256 (Arabic)
  • Windows 1257 (Baltic)
  • Windows 1258 (Vietnam)
  • CP437, CP737 DOS Greek, CP775 DOS BaltRim, CP850, CP852, CP853, CP855, CP856 Hebrew PC, CP857, CP858, CP860, CP861, CP863, CP865, CP866, CP869, CP1125
  • MacRoman, MacCentralEurope, MacIceland, MacCroatian, MacRomaniaCyrillic, MacUkraine, MacGreek, Mac Dingbats, Mac Farsi , Mac Romania
Semitic languages
  • ISO-8859-6 (Arabic)
  • ISO-8859-8 (Hebrew Visual)
  • CP255, CP1256
  • CP862, CP864
  • MacHebrew, MacArabic
Japanese
  • EUC-JP
  • SHIFT_JIS
  • P932
  • ISO-2022-JP, ISO-2022-JP-1, ISO-2022-JP-2, ISO-2022-JP-3
  • EUC-JISX0213
  • Shift_JISX0213
Chinese
  • EUC-CN
  • HZ, GBK
  • GB18030 Standard Chinese
  • UC-TW
  • BIG5
  • CP950
  • BIG5-HKSCS,
  • ISO-2022-CN, ISO-2022-CN-EXT
Korean
  • KOI8-R, KOI8-U, KOI8-RU
  • EUC-KR
  • CP949
  • ISO-2022-KR
  • JOHAB
Armenian
  • ARMSCII-8
Georgian
  • Georgian-Academy
  • Georgian-PS
Tajik
  • KOI8-T
Thai
  • TIS-620
  • CP874 Thai
  • MacThai
Laotian
  • MuleLao-1
  • CP1133
Vietnamese
  • VISCII
  • TCVN
  • CP1258
Platform specific/other
  • HP-ROMAN8
  • NEXTSTEP
  • RISCOS-LATIN1
  • C99
  • JAVA
  • IBM424
  • IBM437
  • IBM850
  • IBM852
  • IBM855
  • IBM857
  • IBM860
  • IBM861
  • IBM862
  • IBM863
  • IBM864
  • IBM865
  • IBM866
  • IBM869
  • JIS_X0201
  • TIS-620
Full Unicode
  • UTF-8
  • UCS-2, UCS-2BE, UCS-2LE
  • UCS-4, UCS-4BE, UCS-4LE
  • UTF-16, UTF-16BE, UTF-16LE
  • UTF-32, UTF-32BE, UTF-32LE
  • UTF-7, UTF-7 Optional Direct Characters

Note:

  • UCS-4 is UTF-32 with support for code points beyond U+10FFFF (which are supposed to be unassignable forever).
  • UCS-2 is UTF-16 with surrogate support removed (so code points beyond U+FFFF cannot be represented).
Turkmen
  • TDS565
  • MacTurkish

Buy Now Download Tour