Search found 735 matches

by DFH
Fri Nov 24, 2017 9:47 pm
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Suggestion for new Pattern Match 'Unicode Character Categories'
Replies: 0
Views: 4

Suggestion for new Pattern Match 'Unicode Character Categories'

Unicode character categories are described in several places online. e.g. http://www.fileformat.info/info/unicode/category/ It would be useful to have a new pattern match method for Unicode Character Categories . This would augment the existing pattern match method for POSIX CHARACTER CLASSES . Best...
by DFH
Tue Nov 21, 2017 8:32 pm
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Suggestion for new filter 'Add Match Numbers'
Replies: 0
Views: 6

Suggestion for new filter 'Add Match Numbers'

The Add Line Numbers filter is already very useful. In principle, the notion can be generalised to Add Match Numbers for pattern matching. Add Line Numbers is then equivalent to matching the PCRE pattern ^ for the Start of Line and adding the serial numbers just before the match position. I therefor...
by DFH
Sat Oct 21, 2017 12:46 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Word frequency list
Replies: 11
Views: 491

Re: Word frequency list

Hereby attached an example for making a counted words list for French words in a complete Bible. The data was derived from https://github.com/MarjorieBurghart/VulgateGlaire NB. The data for the input file had already been preprocessed by means of other bespoke TextPipe filters. The output is uploade...
by DFH
Sat Oct 21, 2017 12:37 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Epub/Mobi text preparation tips
Replies: 1
Views: 101

Re: Epub/Mobi text preparation tips

TextPipe Pro edition is only really required for mainframe filters.

TextPipe Standard suffices for most users.

David
by DFH
Sat Oct 21, 2017 12:35 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: x64 replacement for ActiveX script library
Replies: 14
Views: 2750

Re: x64 replacement for ActiveX script library

See my email sent an hour ago.

Not tried x64 edition of 10.5 yet.

David
by DFH
Sat Oct 21, 2017 12:33 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: UTF-8 sort filter order
Replies: 7
Views: 295

Re: UTF-8 sort filter order

Create some data with a fixed length word preceding the numbers.

Code: Select all

test8.9
test6.9
test10.5
test2
test-1
Then set the starting column as 5 and see what happens.

Replace the word test with random 4 digits, and see if the results change.
by DFH
Sat Oct 21, 2017 12:28 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Word frequency list
Replies: 11
Views: 491

Re: Word frequency list

An alternative method is to use a remove patterns matching [[:punct:]] after first replacing any special punctuation marks you want to keep as part of valid words.

After the words list has been made, the temporary replacements can be readily reverted.
by DFH
Sat Oct 21, 2017 12:23 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Word frequency list
Replies: 11
Views: 491

Re: Word frequency list

Sticking to ANSI, many plural possessives do not end with 's but with s' . However, the possessive for singular cockatrice is cockatrice' as in "a cockatrice' den" . Not many people know that, unless they are familiar with Isaiah 11:8 in the Authorised Version of the Bible. It's not the only unusual...
by DFH
Sat Oct 21, 2017 12:14 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Word frequency list
Replies: 11
Views: 491

Re: Word frequency list

The correct Unicode character that should be used in proper typography for possessives is not \x27 apostrophe but rather U+2019 right single quotation mark. That's true for both English and French as well as a few other languages based on the Latin script. Users of the enhanced feature may not be aw...
by DFH
Sat Oct 21, 2017 12:09 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Unicode normalize filters?
Replies: 6
Views: 273

Re: Unicode normalize filters?

btw. Somebody told me last month that Microsoft had changed how they implement Unicode rendering in Windows 7 comapred to earlier versions of Windows. They no longer use Uniscribe, even though that's is still being maintained. See https://en.wikipedia.org/wiki/Uniscribe and https://en.wikipedia.org/...
by DFH
Sat Oct 21, 2017 12:01 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Unicode normalize filters?
Replies: 6
Views: 273

Re: Unicode normalize filters?

I tested the 26 x 26 x 4 digraphs again and it all worked as it should when converted to NFC. i.e. No unwarranted spurious characters in the output. There are further tests that I could do, but that can wait a while. btw. Did you notify the supplier of the defective Unicode library that something wa...
by DFH
Fri Sep 29, 2017 10:08 pm
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: UTF-8 sort filter order
Replies: 7
Views: 295

Re: UTF-8 sort filter order

Help for a numeric sort states: Numeric sort allows lines to be sorted according to their numeric value. The numeric value must appear at the start of the line (leading spaces are allowed). The number must be in decimal, and can be in floating point format. Any non-numeric characters after the numbe...
by DFH
Thu Sep 28, 2017 12:30 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Word frequency list
Replies: 11
Views: 491

Re: Word frequency list

I await a response about the UTF-8 awareness. Thanks.

David
by DFH
Thu Sep 28, 2017 12:26 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Unicode normalize filters?
Replies: 6
Views: 273

Re: Unicode normalize filters?

The Help page libiconv (under Advanced Topics ) states: About libiconv libiconv is a GNU library used by TextPipe for some of its Unicode conversions. C-Source code, .obj files and binaries are available for free from http://www.gnu.org/software/libiconv/ It's incredible that the Unicode normalize f...
by DFH
Thu Sep 28, 2017 12:22 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: UTF-8 sort filter order
Replies: 7
Views: 295

Re: UTF-8 sort filter order

Temporary workaround until you fix the issue.

Include a Reverse line order filter after the sort if you want an ascending order.

David