Jump to content
Sign in to follow this  
gcue

reconstructing data from html table

Recommended Posts

gcue

i have a website im reading from and am trying to grab the data to put it elsewhere and in a different format i think it can probably be done with stringregexp - but way to complicated for me grasp at this point

each row varies from 1 to 3 columns (which makes it more complicated)

id like to grab the value of each column and build an array keeping each available column with each row of data

for instance:

"GSD Application Support" (1st column)

"NAB Group" (2nd column)

"GSD support for ETS and OPAC (Spark III apps)" (3rd column)

hope this makes sense

thank you very much in advance!

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001356">All ITG A-J</a></font></td><td nowrap></td><td nowrap><font size="2"><a href="/names.nsf/85255af?OpenDocument">Sub Group of #ITG ALL ITG the ITG Associates under JHP, DNR, TWM, RCI with initials starting with A through J</a></font></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001Document">Asia Broker DB - Editor</a></font></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td><td nowrap><font size="2"><a href="/names.nsf/85255e010017?OpenDocument">To include all ETS developer, PSS Prod - the broker contact. detail</a></font></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001356a8852penDocument">BESAdminLAOINV</a></font></td><td nowrap></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001354312?OpenDocument">BES Admin LAO Office Investment Group</a></font></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/855e02556d4penDocument">CG Admin SNOLNM01</a></font></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001356aocument">CG Associates-02</a></font></td><td nowrap></td><td nowrap><font size="2"><a href="/names.nsf/85294?OpenDocument">This distribution list is automatically updated from information in ITAD. Any changes to list names need to be managed in ITAD.</a></font></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn061.gif" border="0" height="11" width="13" alt="Key Icon"></td><td nowrap><font size="2"><a href="/names.nsf/852cc2b?OpenDocument">CRGI_ETP_Editor</a></font></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001356a885255d41?OpenDocument">CRS Avistar</a></font></td><td nowrap></td><td nowrap><font size="2"><a href="/names.nsf/85255ffd41?OpenDocument">Project team members for the CRS Avistar CWI Pilot project needed for Avistar/Sametime connectivity</a></font></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e010e54?OpenDocument">CWI - LAO Associates - 888752</a></font></td><td nowrap><font size="2"><a href="/names.nsf/852e54?OpenDocument">Investment</a></font></td><td nowrap><font size="2"><a href="/names.nsf/85255e01e54?OpenDocument">E-mail change requests to DistList Updates</a></font></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn061.gif" border="0" height="11" width="13" alt="Key Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e0100135aa?OpenDocument">CWI_ETP_Editor</a></font></td><td nowrap></td><td nowrap></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn061.gif" border="0" height="11" width="13" alt="Key Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255a?OpenDocument">Desktop Technology - Disk Encryption Project - Readers</a></font></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn061.gif" border="0" height="11" width="13" alt="Key Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e01001356a8852c83f?OpenDocument">Disk Encryption Production Support - Editors</a></font></td><td nowrap></td><td nowrap></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn061.gif" border="0" height="11" width="13" alt="Key Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255c4?OpenDocument">Disk Encryption Production Support - Readers</a></font></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn122.gif" border="0" height="11" width="13" alt="Unopened yellow envelope Icon"></td><td nowrap><font size="2"><a href="/names.nsf/85255e019802?OpenDocument">DR Desktop</a></font></td><td nowrap></td><td nowrap></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn061.gif" border="0" height="11" width="13" alt="Key Icon"></td><td nowrap><font size="2"><a href="/names.nsf/8572?OpenDocument">Equity Trading CRS Status - Editors</a></font></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td><td nowrap><img src="/icons/ecblank.gif" border="0" height="16" width="1" alt=""></td></tr>

<tr valign="top"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/8525b9f?OpenDocument">GSD Application Support</a></font></td><td nowrap><font size="2"><a href="/names.nsf/85255e55b9f?OpenDocument">NAB Group</a></font></td><td nowrap><font size="2"><a href="/names.nsf/85255e0155b9f?OpenDocument">GSD support for ETS and OPAC (Spark III apps)</a></font></td></tr>

<tr valign="top" bgcolor="#EFEFEF"><td align="center"><img src="/icons/vwicnsr7.gif" border="0" height="12" width="12" alt="0%"></td><td nowrap><img src="/icons/vwicn004.gif" border="0" height="11" width="13" alt="Group Icon"></td><td nowrap><font size="2"><a href="/names.nsf/8525?OpenDocument">GSD Tech Support LAO333</a></font></td><td nowrap><font size="2"><a href="/names.nsf/85255e01000404a22?OpenDocument">NAB Group</a></font></td><td nowrap><font size="2"><a href="/names.nsf/85255e01a22?OpenDocument">GSD Technology Support associates to GSD Assistants. </a></font></td></tr>

Share this post


Link to post
Share on other sites
weaponx

Have you tried _IETableWriteToArray ?

Share this post


Link to post
Share on other sites
gcue

ooooo sounds awesome!!! wow

ill def give it a try.. many thanks weapon =)

Share this post


Link to post
Share on other sites
gcue

wow works beautifully!!!

thanks again weapon

and props to the creator

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
Sign in to follow this  

×