Rorka Posted November 8, 2009 Share Posted November 8, 2009 Okay I want to rip text from a website, is this possible?I want to get the version number which is shown like this:Current Version: 0.2So I only require the 0.2 part.the website link is: ClickyThanks in advance Link to comment Share on other sites More sharing options...
jvanegmond Posted November 8, 2009 Share Posted November 8, 2009 Get text from website: InetGetSource Get text from text: StringInStr, StringMid, StringLeft, StringRegExp github.com/jvanegmond Link to comment Share on other sites More sharing options...
Mat Posted November 8, 2009 Share Posted November 8, 2009 (edited) IE functions will do it. Read the helpfile: User Defined Function ==> IE Management Mat Edit: But InetGet is better Edited November 8, 2009 by Mat AutoIt Project Listing Link to comment Share on other sites More sharing options...
danielkza Posted November 8, 2009 Share Posted November 8, 2009 Try that (worked for me): expandcollapse popup#AutoIt3Wrapper_Change2CUI=y #include <INet.au3> Global Const $URL = "http://www.mmowned.com/forums/bots-programs/266946-release-beta-theexplorer-updated-04-11-09-a.html" ConsoleWrite("Loading URL: " & $URL & @CRLF) Global $Source = _InetGetSource($URL) If Not StringLen($Source) Then ;MsgBox(16, @ScriptName, StringFormat("Error loading the following page.\n\nURL: %s", $URL)) ConsoleWrite("Error loading URL." & @CRLF) ; '" & $URL & "'" & @CRLF) Exit EndIf ; Find posts div Global $CurrentVersion = "" Global $posPos = 0 Global $divPos = StringRegExp($Source,'<div[^>]+id\s*=\s*"posts"[^>]*>', 0) If $divPos Then $divPos = @extended ; Find first post Global $PostID = StringRegExp($Source, '<div[^>]+id\s*=\s*"post_message_(\d+)"[^>]*>', 1, $divPos) $postPos = @extended If Not @error And StringIsInt($PostID[0]) Then ; Find Current version $CurrentVersion = StringRegExp($Source, "(?i)Current\s*Version[^:]*:\s*(\d+\.\d+)", 1, $postPos) If Not @error Then $CurrentVersion = $CurrentVersion[0] EndIf EndIf EndIf If Not StringLen($CurrentVersion) Then ;MsgBox(64, @ScriptName, "Current Version: " & $CurrentVersion) ConsoleWrite("Version not found" & @CRLF) Else ;MsgBox(16, @ScriptName, "Current Version not found.") ConsoleWrite("Current Version: " & $CurrentVersion & @CRLF) EndIf ; Sleep a little so it's actually possible to check the version If @Compiled Then Sleep(5000) EndIf Link to comment Share on other sites More sharing options...
Rorka Posted November 8, 2009 Author Share Posted November 8, 2009 Try that (worked for me): expandcollapse popup#AutoIt3Wrapper_Change2CUI=y #include <INet.au3> Global Const $URL = "http://www.mmowned.com/forums/bots-programs/266946-release-beta-theexplorer-updated-04-11-09-a.html" ConsoleWrite("Loading URL: " & $URL & @CRLF) Global $Source = _InetGetSource($URL) If Not StringLen($Source) Then ;MsgBox(16, @ScriptName, StringFormat("Error loading the following page.\n\nURL: %s", $URL)) ConsoleWrite("Error loading URL." & @CRLF) ; '" & $URL & "'" & @CRLF) Exit EndIf ; Find posts div Global $CurrentVersion = "" Global $posPos = 0 Global $divPos = StringRegExp($Source,'<div[^>]+id\s*=\s*"posts"[^>]*>', 0) If $divPos Then $divPos = @extended ; Find first post Global $PostID = StringRegExp($Source, '<div[^>]+id\s*=\s*"post_message_(\d+)"[^>]*>', 1, $divPos) $postPos = @extended If Not @error And StringIsInt($PostID[0]) Then ; Find Current version $CurrentVersion = StringRegExp($Source, "(?i)Current\s*Version[^:]*:\s*(\d+\.\d+)", 1, $postPos) If Not @error Then $CurrentVersion = $CurrentVersion[0] EndIf EndIf EndIf If Not StringLen($CurrentVersion) Then ;MsgBox(64, @ScriptName, "Current Version: " & $CurrentVersion) ConsoleWrite("Version not found" & @CRLF) Else ;MsgBox(16, @ScriptName, "Current Version not found.") ConsoleWrite("Current Version: " & $CurrentVersion & @CRLF) EndIf ; Sleep a little so it's actually possible to check the version If @Compiled Then Sleep(5000) EndIf Thanks. Link to comment Share on other sites More sharing options...
DaleHohm Posted November 8, 2009 Share Posted November 8, 2009 Edit: But InetGet is betterNot better, just different. If the content you want is created with dynamic HTML or is behind authentication, INetGet will not do it for you, but the IE functions will.Dale Free Internet Tools: DebugBar, AutoIt IE Builder, HTTP UDF, MODIV2, IE Developer Toolbar, IEDocMon, Fiddler, HTML Validator, WGet, curl MSDN docs: InternetExplorer Object, Document Object, Overviews and Tutorials, DHTML Objects, DHTML Events, WinHttpRequest, XmlHttpRequest, Cross-Frame Scripting, Office object model Automate input type=file (Related) Alternative to _IECreateEmbedded? better: _IECreatePseudoEmbedded Better Better? IE.au3 issues with Vista - Workarounds SciTe Debug mode - it's magic: #AutoIt3Wrapper_run_debug_mode=Y Doesn't work needs to be ripped out of the troubleshooting lexicon. It means that what you tried did not produce the results you expected. It begs the questions 1) what did you try?, 2) what did you expect? and 3) what happened instead? Reproducer: a small (the smallest?) piece of stand-alone code that demonstrates your trouble Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now