Removal of URLs

3 posts in this topic

Hello everyone,

I copied few information from website source codes but it has some kind of URLS but don't want those type of URLS in my final saved text file so what filter should I use to prevent copying URLS information in my final text file. I need just text not URLS in my final text file. 

Here is my coding

#include <IE.au3>
#include <String.au3>
#include <Array.au3>

;getting the page source code and storing it into text file for easy reading it and debugging

$file = fileopen(@scriptdir & "\source10.txt", 10)

$IE = _IECreate("http://www.softpedia.com/get/Tweak/System-Tweak/Edge-Blocker.shtml", 0, 0)

$source = _IEDocReadHTML($IE)

FileWrite($file, $source)

;extracting source code between <div class="grid_28 mgtop_30"> and <div class="_tabpage tabpage hidden legible specifications" id="specifications">

$target_source = _StringBetween($source, '<div class="data">', '</div>')

FileWrite ( "1.txt", $target_source[0])




Share this post

Link to post
Share on other sites

maybe try _IEBodyReadText instead of _IEDocReadHTML

if @error Then
	MsgBox(262192,"",@ComputerName&" slaps "&@UserName&" around a bit with a large trout!")

>Exit code: 0

REAL search results

"Yeah yeah yeah, patience. How long will that take?"  -Ed Gruberman

Share this post

Link to post
Share on other sites


It seems you cannot accept that we have limits on acceptable topics here.  Your first and second attempts were locked - and guess what? So is your third!

And moreover our patience is exhausted - you will be unable to post for a month. When and if you return, you had best respect our rules or you will be removed from the community. Furthermore, sending me a PM claiming that the use of the torrent website was "just an example" does not wash - any idea how many times we have heard that excuse?


Any of my own code posted anywhere on the forum is available for use by others without any restriction of any kind._______My UDFs:


ArrayMultiColSort ---- Sort arrays on multiple columns
ChooseFileFolder ---- Single and multiple selections from specified path treeview listing
Date_Time_Convert -- Easily convert date/time formats, including the language used
ExtMsgBox --------- A highly customisable replacement for MsgBox
GUIExtender -------- Extend and retract multiple sections within a GUI
GUIFrame ---------- Subdivide GUIs into many adjustable frames
GUIListViewEx ------- Insert, delete, move, drag, sort, edit and colour ListView items
GUITreeViewEx ------ Check/clear parent and child checkboxes in a TreeView
Marquee ----------- Scrolling tickertape GUIs
NoFocusLines ------- Remove the dotted focus lines from buttons, sliders, radios and checkboxes
Notify ------------- Small notifications on the edge of the display
Scrollbars ----------Automatically sized scrollbars with a single command
StringSize ---------- Automatically size controls to fit text
Toast -------------- Small GUIs which pop out of the notification area


Share this post

Link to post
Share on other sites
This topic is now closed to further replies.