Jump to content
Sign in to follow this  
ReallySimple

How to dump text from a webpage?

Recommended Posts

ReallySimple

Hello,

I'm new to autoit and I'm trying to do a project just for fun. I'm trying to convert a member list(or at least a portion of it) to a txt file database.

http://www.pbnation.com/memberlist.php

This is the memberlist page. I tried using GetWinText and _FileWriteLog to dump the page, but it did not work really(only logged a little text).

Could anyone show me an example of how to grab the member list text? I want to re-create the list to a txt file just like it's displayed, and then go on to the next page and do the same thing and log it to the same .txt file just below the last page list.

I looked into IE.Au3 but couldn't figure it out. Please and thank you.

Share this post


Link to post
Share on other sites
Mecrazycoder

Hello,

I'm new to autoit and I'm trying to do a project just for fun. I'm trying to convert a member list(or at least a portion of it) to a txt file database.

http://www.pbnation.com/memberlist.php

This is the memberlist page. I tried using GetWinText and _FileWriteLog to dump the page, but it did not work really(only logged a little text).

Could anyone show me an example of how to grab the member list text? I want to re-create the list to a txt file just like it's displayed, and then go on to the next page and do the same thing and log it to the same .txt file just below the last page list.

I looked into IE.Au3 but couldn't figure it out. Please and thank you.

Did you try with _IEBodyReadText()? Refer help file for its description

[size="4"][font="Arial Narrow"][font="Garamond"]Attitude is a little thing that makes a big difference[/font][/font][/size][indent][/indent]

Share this post


Link to post
Share on other sites
ReallySimple

Did you try with _IEBodyReadText()? Refer help file for its description

Works, but the list is quiet messy. Is there some way to clean it up?

It list them like this:

http://pastebin.com/m68a54b4

Maybe theres some way I can dump the HTML page and convert it to one giant HTML page with maybe a large portion of the members listed on one HTML page?

Share this post


Link to post
Share on other sites
Mecrazycoder

Works, but the list is quiet messy. Is there some way to clean it up?

It list them like this:

http://pastebin.com/m68a54b4

Maybe theres some way I can dump the HTML page and convert it to one giant HTML page with maybe a large portion of the members listed on one HTML page?

Have you tried with _IETableGetCollection().

[size="4"][font="Arial Narrow"][font="Garamond"]Attitude is a little thing that makes a big difference[/font][/font][/size][indent][/indent]

Share this post


Link to post
Share on other sites
ReallySimple

I got it to clean up a bit.

What func could I use to search for any line with the numbers "1195612311511015011001" and delete that line if found?

Share this post


Link to post
Share on other sites
Mecrazycoder

I got it to clean up a bit.

What func could I use to search for any line with the numbers "1195612311511015011001" and delete that line if found?

This Link will help you


[size="4"][font="Arial Narrow"][font="Garamond"]Attitude is a little thing that makes a big difference[/font][/font][/size][indent][/indent]

Share this post


Link to post
Share on other sites
ReallySimple

Thanks for the spoon feedage, my searching skills suck :)

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
Sign in to follow this  

×

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.