Analyzing Unicode Text with Regular Expressions

Get help with installation and running here.

Moderators: DataMystic Support, Moderators

DFH
Posts: 658
Joined: Sun Dec 09, 2007 2:49 am
Location: UK

Analyzing Unicode Text with Regular Expressions

Postby DFH » Mon Oct 26, 2009 6:59 pm

Here's an article which should be helpful to others:

http://icu-project.org/docs/papers/iuc26_regexp.pdf

Using Regular Expressions with Unicode texts can be a nightmare, largely as (too) much public documentation is geared towards using them just with ANSI characters.

This 18 page article from 2004 rectifies a lot of that.

User avatar
DataMystic Support
Site Admin
Posts: 2164
Joined: Mon Jun 30, 2003 12:32 pm
Location: Melbourne, Australia
Contact:

Re: Analyzing Unicode Text with Regular Expressions

Postby DataMystic Support » Tue Oct 27, 2009 8:54 pm

Thanks David,

TextPipe uses the PCRE (perl compatable regular expression) library - hence all the Unicode regex functions are implemented. Generally you need to check the 'Allow UTF-8' option of the perl or EasyPattern replacement.
Regards,

Simon Carter, http://DataMystic.com/forums/index.php
http://PredictBGL.com - Insulin dose calculator for Type 1 diabetes
http://DownloadPipe.com - 250,000 free software downloads
http://DetachPipe.com - send huge email attachments


Return to “TextPipe Tips and Tricks, Questions and Support”

Who is online

Users browsing this forum: No registered users and 10 guests