Extract Data From a table

Get help with installation and running here.

Moderators: DataMystic Support, Moderators

buckley
Posts: 3
Joined: Sat Nov 29, 2008 8:14 am

Extract Data From a table

Postby buckley » Sat Nov 29, 2008 8:20 am

Hello,

Im very interested in trying out textpipe after discovering it with offline explorer.

I don't want you to do my homework but I would like to check with you if what I'm aiming for is possible.

On this page http://www.humo.be/cps/rde/xchg/humo/hs ... m_Top.html you can find the rating given to a movie.
It is below the top 10.

Eg. Vinyan has 3.5 stars

Is it possible with TP to mump this data in a sql server everytime a new move shows up in the list?

Bascily my challegne boild down to parsing a table and detectig under wich rating the row hangs (I think)

Kind Regards, Tom

buckley
Posts: 3
Joined: Sat Nov 29, 2008 8:14 am

Re: Extract Data From a table

Postby buckley » Wed Dec 03, 2008 7:10 pm

Bump?

User avatar
DataMystic Support
Site Admin
Posts: 2162
Joined: Mon Jun 30, 2003 12:32 pm
Location: Melbourne, Australia
Contact:

Re: Extract Data From a table

Postby DataMystic Support » Thu Dec 04, 2008 6:54 am

Sure Tom,

As you say, you need to extract just the relevant sections. Look at the whitepaper part of the site for a useful guide to web site mining.

The best approach would be to attempt to insert a new title into a database every time you poll the website, and use the db index to discard duplicate rows.
Regards,

Simon Carter, http://DataMystic.com/forums/index.php
http://PredictBGL.com - Insulin dose calculator for Type 1 diabetes
http://DownloadPipe.com - 250,000 free software downloads
http://DetachPipe.com - send huge email attachments

buckley
Posts: 3
Joined: Sat Nov 29, 2008 8:14 am

Re: Extract Data From a table

Postby buckley » Thu Dec 04, 2008 8:16 am

OK that makes sense (ignore duplicate values in the index)

What technique should I use to caputre the rating ?

rating ***
Movie 1
Movie 2
Movie 3
raing *
Move 4

=> What if Movie 3 was new and I need to store 3 starts with it?

Should I write procedure logic to capture this or is there another technique you can advice?

Regards, Tom

User avatar
DataMystic Support
Site Admin
Posts: 2162
Joined: Mon Jun 30, 2003 12:32 pm
Location: Melbourne, Australia
Contact:

Re: Extract Data From a table

Postby DataMystic Support » Thu Dec 04, 2008 10:15 am

Hi Tom,

First generate an extract that contains just titles and ratings. Then use a Restrict to each line filter, and inside this 2 steps - capture the rating to a variable and then add it to each line with an Add Left Margin filter.
Regards,

Simon Carter, http://DataMystic.com/forums/index.php
http://PredictBGL.com - Insulin dose calculator for Type 1 diabetes
http://DownloadPipe.com - 250,000 free software downloads
http://DetachPipe.com - send huge email attachments


Return to “TextPipe Tips and Tricks, Questions and Support”

Who is online

Users browsing this forum: Yahoo [Bot] and 1 guest