Page 1 of 1

Character - Encoding

Posted: Wed Mar 18, 2009 12:40 pm
by +sveinajoha+
Hi - I have been using TextPipe for some Years now - An exellent program! :D
But now im stuck! :)

Im parsing an rss feed - and need to save the file inn utf-8 format, after encoding special characters:
Example: The character Å encoded with either the html code( Å ) or the iso latin 1 code ( Å) .

The filter runs ok in the trial run area - replasing all the special characters as intended.

But when running the filter in normal mode, from command line, the output file still have the orignial specialc haracters intact?
Ive tested all the Unicode filters ++, but nothing seem to work.

How can encode special characters and keep the encoding?

Any suggestions? :)


Re: Character - Encoding

Posted: Wed Mar 18, 2009 11:17 pm
by +sveinajoha+
Hmm.. been trying more, but with noe results.
trying to save as ANSI > results in ÅÆØ... characters beeing removed (both in native fomat Å and formatted as html &Aoring; )

The original file is utf-8, no EOL - the prosessed file becomes utf-8 with pc/dos EOL

What im trying to achieve is like when i manually open the file in windows Notepad, I can search and replace all Å characters and save the file with the replaced Å intact.
I know that the file becomes ANSI when opening and saving it from Notepad. But this seem to work with my rss parser so...
Also manually editing and replacing the Å cahr with &Aoring; in the textfile (rss file) directly on my ftp server works...

Any suggestions?


Re: Character - Encoding

Posted: Thu Mar 19, 2009 12:46 am
by +sveinajoha+
Problem resolved!
And it was an easy solution.

Before search/replace > convert utf-8 file to ANSI
After the search/replace > convert ANSI to utf-8



Re: Character - Encoding

Posted: Fri Mar 20, 2009 9:27 am
by DataMystic Support
You can also check the 'UTF-8' option of the perl pattern options - the [...] box in the search/replace filter.