Jump to content

RegExp to trim a part of a string?


Gianni
 Share

Recommended Posts

Hi all,

is there a regular expression pattern that can be used to remove part of a string based on the position of the substring, that is, by specifying the start and end characters of the block to remove? thus obtaining a new string without the "piece" indicated.

for example:

Global $sString = "Today I do not feel good"
; chars:           000000000111111111122222
;                  123456789012345678901234

; remove a chunk from char 9 to char 15
MsgBox(0, '', _StringTrimMid($sString, 9, 15))


; Returns the string trimmed by characters from $iStartCut to $iEndCut.
Func _StringTrimMid($sInput, $iStartCut, $iEndCut)

    Return StringLeft($sInput, $iStartCut - 1) & StringMid($sInput, $iEndCut + 1)

    ; can be done the same with a RegExp ??

EndFunc   ;==>_StringTrimMid

 

 

image.jpeg.9f1a974c98e9f77d824b358729b089b0.jpeg Chimp

small minds discuss people average minds discuss events great minds discuss ideas.... and use AutoIt....

Link to comment
Share on other sites

Link to comment
Share on other sites

@Nine I was just about to post a very similar solution, which I translated from here. 😄

Here's what I came up with --

Func _StringTrimMid($sInput, $iStartCut, $iEndCut)

    Local $lFind = "^(.{" & $iStartCut - 1 & "}).{" & ($iEndCut - $iStartCut + 1) & "}"
    Local $lResult = StringRegExpReplace($sInput, $lFind, "\1")

    Return $lResult
EndFunc   ;==>_StringTrimMid

 

Link to comment
Share on other sites

I'd be tempted to prepend (?s) aka PCRE_DOTALL so that the dot matches newline sequences as well, if ever the case occurs. It all depends on the use case context.

This wonderful site allows debugging and testing regular expressions (many flavors available). An absolute must have in your bookmarks.
Another excellent RegExp tutorial. Don't forget downloading your copy of up-to-date pcretest.exe and pcregrep.exe here
RegExp tutorial: enough to get started
PCRE v8.33 regexp documentation latest available release and currently implemented in AutoIt beta.

SQLitespeed is another feature-rich premier SQLite manager (includes import/export). Well worth a try.
SQLite Expert (freeware Personal Edition or payware Pro version) is a very useful SQLite database manager.
An excellent eBook covering almost every aspect of SQLite3: a must-read for anyone doing serious work.
SQL tutorial (covers "generic" SQL, but most of it applies to SQLite as well)
A work-in-progress SQLite3 tutorial. Don't miss other LxyzTHW pages!
SQLite official website with full documentation (may be newer than the SQLite library that comes standard with AutoIt)

Link to comment
Share on other sites

Thanks @Nine and @Danp2 :)

31 minutes ago, jchd said:

I'd be tempted to prepend (?s) aka PCRE_DOTALL so that the dot matches newline sequences as well, if ever the case occurs. It all depends on the use case context.

Thanks @jchd

... the use case context should be as much as possible of general use ...
how is your pattern which also allows @cr / @crlf .. (and @tab?)

Thanks

 

image.jpeg.9f1a974c98e9f77d824b358729b089b0.jpeg Chimp

small minds discuss people average minds discuss events great minds discuss ideas.... and use AutoIt....

Link to comment
Share on other sites

By default . (dot) matches eveything except newlines. By prepending (?s) to the pattern, subsequent . will match newlines as well. TABs are matched by . in all cases. To be completely generic you may even want to add (*UCP) at the very start of the pattern: this way, extra Unicode newline sequences are matched as well when using (?s).

This wonderful site allows debugging and testing regular expressions (many flavors available). An absolute must have in your bookmarks.
Another excellent RegExp tutorial. Don't forget downloading your copy of up-to-date pcretest.exe and pcregrep.exe here
RegExp tutorial: enough to get started
PCRE v8.33 regexp documentation latest available release and currently implemented in AutoIt beta.

SQLitespeed is another feature-rich premier SQLite manager (includes import/export). Well worth a try.
SQLite Expert (freeware Personal Edition or payware Pro version) is a very useful SQLite database manager.
An excellent eBook covering almost every aspect of SQLite3: a must-read for anyone doing serious work.
SQL tutorial (covers "generic" SQL, but most of it applies to SQLite as well)
A work-in-progress SQLite3 tutorial. Don't miss other LxyzTHW pages!
SQLite official website with full documentation (may be newer than the SQLite library that comes standard with AutoIt)

Link to comment
Share on other sites

21 minutes ago, jchd said:

By default . (dot) matches eveything except newlines. By prepending (?s) to the pattern, subsequent . will match newlines as well. TABs are matched by . in all cases. To be completely generic you may even want to add (*UCP) at the very start of the pattern: this way, extra Unicode newline sequences are matched as well when using (?s).

.... it would seem that you are right ..... :think:
... I'm kidding of course, ... I had no doubt about that :thumbsup:

Thank You @jchd

Global $sString = "Today " & @CRLF & "I do not feel good"
; chars:           000000     00      011111111112222222
;                  123456     78      901234567890123456

; remove a chunk from char 11 to char 17
MsgBox(0, '', _StringTrimMid($sString, 11, 17))


; Returns the string trimmed by characters from $iStartCut to $iEndCut.
Func _StringTrimMid($sInput, $iStartCut, $iEndCut)

    Local $lFind = "(*UCP)(?s)" & "^(.{" & $iStartCut - 1 & "}).{" & ($iEndCut - $iStartCut + 1) & "}"
    Local $lResult = StringRegExpReplace($sInput, $lFind, "\1")

    Return $lResult

EndFunc   ;==>_StringTrimMid

 

Edited by Chimp

 

image.jpeg.9f1a974c98e9f77d824b358729b089b0.jpeg Chimp

small minds discuss people average minds discuss events great minds discuss ideas.... and use AutoIt....

Link to comment
Share on other sites

_StringWordTrimMid ^_^

#include <Array.au3>

Global $sString = "Today " & @CRLF & @CRLF & "I do not feel good i won't quit!"

MsgBox(0, '', _StringWordTrimMid($sString, 2, 6))

; Returns the string trimmed by words from $iStartCut to $iEndCut.
Func _StringWordTrimMid($sInput, $iStartCut, $iEndCut)
    Local $a = StringRegExp($sInput, '[\H]+', 3)
    _ArrayDelete($a, $iStartCut & "-" & $iEndCut)
    Return _ArrayToString($a, " ")
EndFunc

 

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
 Share

×
×
  • Create New...