Tamas Posted May 9, 2010 Share Posted May 9, 2010 Hi All! I have a simple question. How can i get URL address from html source code? I would like to choose by name the class in the code. For example: html code: <H1 class=vezeto><A href="/x.php?id=inxcl&url=http%3A%2F%2Fsportgeza.hu%2Fforma1%2F2010%2F05%2F09%2Fwebber_megallithatatlan_volt_barcelonaban%2F">Webber megállíthatatlan volt Barcelonában</A></H1> what i need: the link from the 'vezeto' class, this: /x.php?id=inxcl&url=http%3A%2F%2Fsportgeza.hu%2Fforma1%2F2010%2F05%2F09%2Fwebber_megallithatatlan_volt_barcelonaban%2F in the code from " to " This link is always changing. I need a script that always reads the link. Thanks for the help! Link to comment Share on other sites More sharing options...
hawky358 Posted May 9, 2010 Share Posted May 9, 2010 (edited) I am assuming it's a php generated page, so the layout of the stuff around the link will always remain the same. You can work this section into your code If there are multiple vezeto, then you'll have to do some filtering $file = "1.html" $source = FileRead($file) $start = StringInStr($source,'<H1 class=vezeto><A href="') $end = StringInStr($source,'"',-1,2,$start) $link = StringMid($source,$start,$end-$start) $link = StringReplace($link,'<H1 class=vezeto><A href="',"") Edited May 9, 2010 by hawky358 Link to comment Share on other sites More sharing options...
Tamas Posted May 9, 2010 Author Share Posted May 9, 2010 this is a good solution, but if it would be possible without the file management, that would be the best! maybe with the _IEBodyReadHTML, or something.. (INet.au3) im trying to find the solution too Link to comment Share on other sites More sharing options...
hawky358 Posted May 9, 2010 Share Posted May 9, 2010 (edited) maybe with the _IEBodyReadHTMLThe thing I don't like about the _IE... functions is that it opens an IE window. Do you want no files whatsoever? This way you download the file, process it then delete it. I used Inetget() without background, but you can incorporate it to do background downloading if you want to. InetGet("http://www.google.com", "temp.html",1) ;do some work here; FileDelete("temp.html") If you REALLY don't want ANY files to be written I guess you can use _IE.... Edited May 9, 2010 by hawky358 Link to comment Share on other sites More sharing options...
Tamas Posted May 9, 2010 Author Share Posted May 9, 2010 Thank you!!! it's resolved without file managament with YOUR HELP (string concat) so thx! Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now