Sign in to follow this  
Followers 0
wimhek

Help with Regexp

4 posts in this topic

I am havig troubles with regexp. I want to split html lines into an array. After some googeling i found an answer, here it is.

$sString = "<td>aap</td><td>noot</td><td>mies</td>"

$aReturn = StringRegExp($sString, '(?s)(?i)<td>(.*?)</td>', 3)

_arraydisplay($aReturn)

returns :

aap

noot

mies

Question, now is

$sString = "<td>aap</td><td class=e>noot</td><td>mies</td>"

What must be the regexp rule to get the same result ?

Share this post


Link to post
Share on other sites



wimhek,

Tell the RegEx that there may be other characters before the closing ">": ;)

(?s)(?i)<td.*?>(.*?)</td>

That works for me on both your examples. :)

M23


Any of my own code posted anywhere on the forum is available for use by others without any restriction of any kind._______My UDFs:

Spoiler

ArrayMultiColSort ---- Sort arrays on multiple columns
ChooseFileFolder ---- Single and multiple selections from specified path treeview listing
Date_Time_Convert -- Easily convert date/time formats, including the language used
ExtMsgBox --------- A highly customisable replacement for MsgBox
GUIExtender -------- Extend and retract multiple sections within a GUI
GUIFrame ---------- Subdivide GUIs into many adjustable frames
GUIListViewEx ------- Insert, delete, move, drag, sort, edit and colour ListView items
GUITreeViewEx ------ Check/clear parent and child checkboxes in a TreeView
Marquee ----------- Scrolling tickertape GUIs
NoFocusLines ------- Remove the dotted focus lines from buttons, sliders, radios and checkboxes
Notify ------------- Small notifications on the edge of the display
Scrollbars ----------Automatically sized scrollbars with a single command
StringSize ---------- Automatically size controls to fit text
Toast -------------- Small GUIs which pop out of the notification area

 

Share this post


Link to post
Share on other sites

Thank You, i did the

.*? between () like this .

Share this post


Link to post
Share on other sites

Thank You, i did the

.*? between () like this .

Well, that'll create an extra entry in the returned array from StringRegExp. If you need that then cool, otherwise just drop the group.

Also, although Melba23's example is good and working,here's another variation:

(?s)(?i)<td[^>]*>(.*?)</td>

The character class will eat anything up until the first > it encounters. Melba's regexp works because the ? inverts the greediness of the * quantifier, thus stopping it at the first >. Without it it would basically eat the entire remaining string.


[center]Spiderskank Spiderskank[/center]GetOpt Parse command line options UDF | AU3Text Program internationalization UDF | Identicon visual hash UDF

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!


Register a new account

Sign in

Already have an account? Sign in here.


Sign In Now
Sign in to follow this  
Followers 0