Sign in to follow this  
Followers 0
Vadersapien

Parse HTML or XML?

7 posts in this topic

Hello,

How would I go about parsing a HTML or XML file without embedding IE in my application?


Try Pacfox, my Firefox theme.Try Power Eject, my windows gadget that allows you to eject most drives.Using AutoIt 3.3.4.0, Windows 7 Premium, Intel Core 2 Quad CPU @ 2.66ghz, 4gb RAM, Nvidia GeForce 9500GT Graphics Card & Samsung 22" Monitor.

Share this post


Link to post
Share on other sites



Regular expressions! What exactly are you trying to find?


David Nuttall
Nuttall Computer Consulting

An Aquarius born during the Age of Aquarius

AutoIt allows me to re-invent the wheel so much faster.

I'm off to write a wizard, a wonderful wizard of odd...

Share this post


Link to post
Share on other sites

??Confused??

Anyway what I'm trying to do is build my own basic web rendering engine...


Try Pacfox, my Firefox theme.Try Power Eject, my windows gadget that allows you to eject most drives.Using AutoIt 3.3.4.0, Windows 7 Premium, Intel Core 2 Quad CPU @ 2.66ghz, 4gb RAM, Nvidia GeForce 9500GT Graphics Card & Samsung 22" Monitor.

Share this post


Link to post
Share on other sites

Also, a search for XML in the Examples forum yields these promising threads:

XML DOM wrapper

XML DOM download link

XML Read

I haven't tested them, but they might also be useful.


David Nuttall
Nuttall Computer Consulting

An Aquarius born during the Age of Aquarius

AutoIt allows me to re-invent the wheel so much faster.

I'm off to write a wizard, a wonderful wizard of odd...

Share this post


Link to post
Share on other sites

Anyway what I'm trying to do is build my own basic web rendering engine...

Have you thought about things apart from the HTML/XML parsing?

What library do you intend to use for rendering?

Share this post


Link to post
Share on other sites

_StringBetween() looks alright...and the XML parsers look even better...

It'd be quite easy to parse an XHTML page, but HTML is a different story, since tags aern't always closed.

Is there a way I can search for the tags that need closing, and close them with my script so they they are ready to be parsed?

Have you thought about things apart from the HTML/XML parsing?

Yeah, I thought about Javascript & CSS parsing...and I'm guessing that would be much harder.

Try Pacfox, my Firefox theme.Try Power Eject, my windows gadget that allows you to eject most drives.Using AutoIt 3.3.4.0, Windows 7 Premium, Intel Core 2 Quad CPU @ 2.66ghz, 4gb RAM, Nvidia GeForce 9500GT Graphics Card & Samsung 22" Monitor.

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!


Register a new account

Sign in

Already have an account? Sign in here.


Sign In Now
Sign in to follow this  
Followers 0