|
| ||||||
|
A Unicode map allows 2 byte (UTF-16) or 4 byte (UTF-32) Unicode characters to be mapped to a sequence of zero or more output characters in the same way that single byte characters are mapped. The primary difference between ANSI maps and Unicode maps is that a Unicode map only specifies characters that are changed - non-specified characters are controlled by a separate setting (otherwise you would have an unwieldy list of 65536 to 1 million entries to edit and work with). The Unicode map expects characters to be found in big endian order, i.e. MSB first, so the Unicode code point x00FA would be found in the file as two consecutive characters, 0x00 0xFA, NOT 0xFA 0x00. If the Unicode map does not work for you, you can check your input file format using the Convert\Hex Dump filter, and if necessary, use the Swap UTF-16 word order or Swap UTF-32 word order filter. A Unicode map filter looks like this:
Type a value in the output string column to define how a Unicode character gets remapped. Click the up and down buttons in the Start Range (Hex) or End Range (Hex) columns or type a new number to change the range of Unicode characters being remapped. New values can be added on the last row, or using the Populate Values panel. Delete Selected RowsDeletes the currently selected rows from the grid. Character ranges not defined in the grid are controlled by the Non-entered characters panel. Open Map FileOpens a Unicode map file, replacing the existing map grid (not the non-entered characters panel values). Maps can be loaded from Excel worksheets (.XLS), Comma Separated Value files (.CSV, the default when the file extension is not recognized) and Tab delimited value files (.TAB). Save Map to FileSaves the current Unicode map grid to a file (not the non-entered characters panel values). Maps can be saved to Excel worksheets (.XLS), .CSV Comma Separated Value file (the default) or a .TAB Tab-delimited value file (only when the file extension is .TAB). Values are saved exactly as they are shown on screen. UTF ModeThis drop down specifies whether each input character is 2 bytes (UTF-16) or 4 bytes (UTF-32). Non-entered charactersThis panel controls what happens to characters not found in any range in the grid.
Populate valuesThe populate values group makes it easy to set large groups of values. First enter in a starting value and an ending value (in decimal), then click the button corresponding to your choice:
Sample sub filter VBScript
See also
|
|
Contact
Us
Support
Community
Tutorials and User Guides (online) |