Jump to content
Sign in to follow this  
Vadersapien

Parse HTML or XML?

Recommended Posts

Vadersapien

Hello,

How would I go about parsing a HTML or XML file without embedding IE in my application?


Try Pacfox, my Firefox theme.Try Power Eject, my windows gadget that allows you to eject most drives.Using AutoIt 3.3.4.0, Windows 7 Premium, Intel Core 2 Quad CPU @ 2.66ghz, 4gb RAM, Nvidia GeForce 9500GT Graphics Card & Samsung 22" Monitor.

Share this post


Link to post
Share on other sites
Nutster

Regular expressions! What exactly are you trying to find?


David Nuttall
Nuttall Computer Consulting

An Aquarius born during the Age of Aquarius

AutoIt allows me to re-invent the wheel so much faster.

I'm off to write a wizard, a wonderful wizard of odd...

Share this post


Link to post
Share on other sites
Vadersapien

??Confused??

Anyway what I'm trying to do is build my own basic web rendering engine...


Try Pacfox, my Firefox theme.Try Power Eject, my windows gadget that allows you to eject most drives.Using AutoIt 3.3.4.0, Windows 7 Premium, Intel Core 2 Quad CPU @ 2.66ghz, 4gb RAM, Nvidia GeForce 9500GT Graphics Card & Samsung 22" Monitor.

Share this post


Link to post
Share on other sites
Nutster

Also, a search for XML in the Examples forum yields these promising threads:

XML DOM wrapper

XML DOM download link

XML Read

I haven't tested them, but they might also be useful.


David Nuttall
Nuttall Computer Consulting

An Aquarius born during the Age of Aquarius

AutoIt allows me to re-invent the wheel so much faster.

I'm off to write a wizard, a wonderful wizard of odd...

Share this post


Link to post
Share on other sites
jvanegmond

Anyway what I'm trying to do is build my own basic web rendering engine...

Have you thought about things apart from the HTML/XML parsing?

What library do you intend to use for rendering?

Share this post


Link to post
Share on other sites
Vadersapien

_StringBetween() looks alright...and the XML parsers look even better...

It'd be quite easy to parse an XHTML page, but HTML is a different story, since tags aern't always closed.

Is there a way I can search for the tags that need closing, and close them with my script so they they are ready to be parsed?

Have you thought about things apart from the HTML/XML parsing?

Yeah, I thought about Javascript & CSS parsing...and I'm guessing that would be much harder.

Try Pacfox, my Firefox theme.Try Power Eject, my windows gadget that allows you to eject most drives.Using AutoIt 3.3.4.0, Windows 7 Premium, Intel Core 2 Quad CPU @ 2.66ghz, 4gb RAM, Nvidia GeForce 9500GT Graphics Card & Samsung 22" Monitor.

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
Sign in to follow this  

×

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.