Jump to content

RegExp - question - new line between two different start-up lines


Go to solution Solved by Malkey,

Recommended Posts

Posted

I have this kind of source data:

TEST1   2013-02-08  some text
TEST1   2013-2-08   some text
TEST1   2013-01-15  some text
TEST1   2013-01-15  some text
TEST1   2013-01-03  some text
TEST2   2013-1-17   some text
TEST2   2013-1-18   some text
TEST2   2013-1-21   some text
 
I wondered how using SRE add a new line, but only at the point where the difference in the first word, ie in the present case, separate lines starting with TEST1 from the line starting with TEST2 
 
Can someone give me a hint?

Signature beginning:
Please remember: "AutoIt"..... *  Wondering who uses AutoIt and what it can be used for ? * Forum Rules *
ADO.au3 UDF * POP3.au3 UDF * XML.au3 UDF * IE on Windows 11 * How to ask ChatGPT for AutoIt Codefor other useful stuff click the following button:

  Reveal hidden contents

Signature last update: 2023-04-24

  • Moderators
Posted

mLipok,

Why use an SRE - I see no advantage here. Read the file into an array, run through it until you find a changeover point, insert the new line, and then save the new array. ;)

M23

Public_Domain.png.2d871819fcb9957cf44f4514551a2935.png Any of my own code posted anywhere on the forum is available for use by others without any restriction of any kind

Open spoiler to see my UDFs:

  Reveal hidden contents

 

Posted

You light up my mind as a light bulb  :idea:

Thanks.

I just use your idea.

But the question is still interesting for me. 
As someone who has a solution for RegExp I would be grateful.

Signature beginning:
Please remember: "AutoIt"..... *  Wondering who uses AutoIt and what it can be used for ? * Forum Rules *
ADO.au3 UDF * POP3.au3 UDF * XML.au3 UDF * IE on Windows 11 * How to ask ChatGPT for AutoIt Codefor other useful stuff click the following button:

  Reveal hidden contents

Signature last update: 2023-04-24

  • Moderators
Posted

mLipok,

When I first started learning about SREs, GEOSoft gave me the best piece of advice I ever got concerning them - "learn when not to use them". I now pass the same advice on to you (and anyone else reading). :)

M23

Public_Domain.png.2d871819fcb9957cf44f4514551a2935.png Any of my own code posted anywhere on the forum is available for use by others without any restriction of any kind

Open spoiler to see my UDFs:

  Reveal hidden contents

 

Posted
Not so long ago I read the same "golden idea" / anecdote from another member.
 
Just tell me if by this time the idea in your case, turned out to be right.

Signature beginning:
Please remember: "AutoIt"..... *  Wondering who uses AutoIt and what it can be used for ? * Forum Rules *
ADO.au3 UDF * POP3.au3 UDF * XML.au3 UDF * IE on Windows 11 * How to ask ChatGPT for AutoIt Codefor other useful stuff click the following button:

  Reveal hidden contents

Signature last update: 2023-04-24

Posted (edited)

Yes ... absolutely

Why do it simple when it can be complicated ?

$txt = _
"TEST1   2013-02-08  some text" & @crlf & _
"TEST1   2013-2-08   some text" & @crlf & _
"TEST1   2013-01-15  some text" & @crlf & _
"TEST2   2013-01-15  some text" & @crlf & _
"TEST2   2013-01-03  some text" & @crlf & _
"TEST3   2013-1-17   some text" & @crlf & _
"TEST4   2013-1-18   some text" & @crlf & _
"TEST4   2013-1-19   some text" & @crlf & _
"TEST5   2013-1-21   some text"

$txt = StringRegExpReplace($txt, '((?<=\v)(\S+).+\R(?!\2))', '$1' & @crlf )

Msgbox(0,"", $txt)

Melba, read and agreed :)

Edited by mikell
Posted
Super 
Thanks a lot. 
I am grateful.
 
mLipok

Signature beginning:
Please remember: "AutoIt"..... *  Wondering who uses AutoIt and what it can be used for ? * Forum Rules *
ADO.au3 UDF * POP3.au3 UDF * XML.au3 UDF * IE on Windows 11 * How to ask ChatGPT for AutoIt Codefor other useful stuff click the following button:

  Reveal hidden contents

Signature last update: 2023-04-24

Posted

Sweet regular expression.

UDF List:

  Reveal hidden contents

Updated: 22/04/2018

Posted

A fine adjustment needed.

Because of this "(?<=v)(S+)", the first line will not match because it does not have a preceeding vertical white space. Therefore, when using this post's example's test data, "StringRegExpReplace($txt, '((?<=v)(S+).+R(?!2))', '$1' & @crlf )" would fail to put an extra newline after the first line.
Otherwise it works great.

Local $txt = _
        "TEST1   2013-02-08  some text" & @CRLF & _
        "TEST2   2013-2-08   some text" & @CRLF & _
        "TEST2   2013-01-15  some text" & @CRLF & _
        "TEST2   2013-01-15  some text" & @CRLF & _
        "TEST2   2013-01-03  some text" & @LF & _
        "TEST3   2013-1-17   some text" & @LF & _
        "TEST4   2013-1-18   some text" & @LF & _
        "TEST4   2013-1-19   some text" & @LF & _
        "TEST5   2013-1-21   some text"

;$txt = StringRegExpReplace($txt, '(((?<=\v)\S+)\N+\R(?!\2))',  '$1' & @crlf )
$txt = StringRegExpReplace($txt, '(?m)((^\S+)\V+\R(?!\2))', '$1' & @CRLF)

ConsoleWrite($txt & @LF)
Posted
I worked a bit on this and I wanted do a little more complicated task. 
But unfortunately I lost. 
 
Here is What I trying:
Source string is a content copied from XLS.
Column are separated by @TAB. 
First column ie. TEST1 or TEST2 can contain whitechar.
 
 
Local $txt = ''
$txt &= "TEST1   2013-02-08  some text" & @CRLF 
; HERE I want to add new line
$txt &= "TEST 2   2013-2-08   some text" & @CRLF
$txt &= "TEST 2   2013-01-15  some text" & @CRLF
$txt &= "TEST 2   2013-01-15  some text" & @CRLF
$txt &= "TEST 2   2013-01-03  some text" & @LF
; HERE I want to add new line
$txt &= "TEST3   2013-1-17   some text" & @LF
; HERE I want to add new line
$txt &= "TEST4   2013-1-17   some text" & @LF
$txt &= "TEST4   2013-1-18   some text" & @LF
$txt &= "TEST4   2013-1-18   some text" & @LF
; HERE I want to add new line
$txt &= "TEST5   2013-1-21   some text" & @LF
$txt &= "TEST5   2013-1-21   some text" & @CRLF
; HERE I want to add new line
$txt &= "TEST 5   2013-1-21   some text" & @LF

;$txt = StringRegExpReplace($txt, '(((?<=\v)\S+)\N+\R(?!\2))',  '$1' & @crlf )
$txt = StringRegExpReplace($txt, '((?<=\v|^)(\S+).+\R(?!\2))',  '$1' & @crlf )

ConsoleWrite($txt & @LF)

Signature beginning:
Please remember: "AutoIt"..... *  Wondering who uses AutoIt and what it can be used for ? * Forum Rules *
ADO.au3 UDF * POP3.au3 UDF * XML.au3 UDF * IE on Windows 11 * How to ask ChatGPT for AutoIt Codefor other useful stuff click the following button:

  Reveal hidden contents

Signature last update: 2023-04-24

Posted (edited)

The correction to do is really a minor one, simple adaptation to new circumstances

Here you only need to match non-tab characters in the first group

Local $txt = ''
$txt &= "TEST1" &@TAB& "2013-02-08" &@TAB& "some text" & @CRLF 
; HERE I want to add new line
$txt &= "TEST 2" &@TAB& "2013-2-08" &@TAB& "some text" & @CRLF
$txt &= "TEST 2" &@TAB& "2013-01-15" &@TAB& "some text" & @CRLF
$txt &= "TEST 2" &@TAB& "2013-01-15" &@TAB& "some text" & @CRLF
$txt &= "TEST 2" &@TAB& "2013-01-03" &@TAB& "some text" & @LF
; HERE I want to add new line
$txt &= "TEST3" &@TAB& "2013-1-17" &@TAB& "some text" & @LF
; HERE I want to add new line
$txt &= "TEST4" &@TAB& "2013-1-17" &@TAB& "some text" & @LF
$txt &= "TEST4" &@TAB& "2013-1-18" &@TAB& "some text" & @LF
$txt &= "TEST4" &@TAB& "2013-1-18" &@TAB& "some text" & @LF
; HERE I want to add new line
$txt &= "TEST5" &@TAB& "2013-1-21" &@TAB& "some text" & @LF
$txt &= "TEST5" &@TAB& "2013-1-21" &@TAB& "some text" & @CRLF
; HERE I want to add new line
$txt &= "TEST 5" &@TAB& "2013-1-21" &@TAB& "some text" 

$txt = StringRegExpReplace($txt, '((?<=\v|^)([^\t]+).+\R(?!\2))', '$1' & @crlf )
Msgbox(0,"", $txt)

Or this using Malkey's code

$txt = StringRegExpReplace($txt, '(?m)((^[^\t]+)\V+\R(?!\2))', '$1' & @CRLF)
Edited by mikell
  • Moderators
Posted

mikell,

Well done. :thumbsup:

It seems I was once again too pessimistic about the capabilities of RegExes. But if you add in the time it would have taken me to develop a pattern like that (close to the known age of the universe I expect ;)) using an SRE would have taken longer for me than my initial suggestion! :D

M23

Public_Domain.png.2d871819fcb9957cf44f4514551a2935.png Any of my own code posted anywhere on the forum is available for use by others without any restriction of any kind

Open spoiler to see my UDFs:

  Reveal hidden contents

 

Posted

  On 1/30/2014 at 11:31 AM, mikell said:

 

The correction to do is really a minor one, simple adaptation to new circumstances

Here you only need to match non-tab characters in the first group

Local $txt = ''
$txt &= "TEST1" &@TAB& "2013-02-08" &@TAB& "some text" & @CRLF 
; HERE I want to add new line
$txt &= "TEST 2" &@TAB& "2013-2-08" &@TAB& "some text" & @CRLF
$txt &= "TEST 2" &@TAB& "2013-01-15" &@TAB& "some text" & @CRLF
$txt &= "TEST 2" &@TAB& "2013-01-15" &@TAB& "some text" & @CRLF
$txt &= "TEST 2" &@TAB& "2013-01-03" &@TAB& "some text" & @LF
; HERE I want to add new line
$txt &= "TEST3" &@TAB& "2013-1-17" &@TAB& "some text" & @LF
; HERE I want to add new line
$txt &= "TEST4" &@TAB& "2013-1-17" &@TAB& "some text" & @LF
$txt &= "TEST4" &@TAB& "2013-1-18" &@TAB& "some text" & @LF
$txt &= "TEST4" &@TAB& "2013-1-18" &@TAB& "some text" & @LF
; HERE I want to add new line
$txt &= "TEST5" &@TAB& "2013-1-21" &@TAB& "some text" & @LF
$txt &= "TEST5" &@TAB& "2013-1-21" &@TAB& "some text" & @CRLF
; HERE I want to add new line
$txt &= "TEST 5" &@TAB& "2013-1-21" &@TAB& "some text" 

$txt = StringRegExpReplace($txt, '((?<=\v|^)([^\t]+).+\R(?!\2))', '$1' & @crlf )
Msgbox(0,"", $txt)

Or this using Malkey's code

$txt = StringRegExpReplace($txt, '(?m)((^[^\t]+)\V+\R(?!\2))', '$1' & @CRLF)

 

Thanks

I try tonight

  On 1/30/2014 at 11:39 AM, Melba23 said:

mikell,

Well done. :thumbsup:

It seems I was once again too pessimistic about the capabilities of RegExes. But if you add in the time it would have taken me to develop a pattern like that (close to the known age of the universe I expect ;)) using an SRE would have taken longer for me than my initial suggestion! :D

M23

 

I will try to do tonight benchmarking performance.

Signature beginning:
Please remember: "AutoIt"..... *  Wondering who uses AutoIt and what it can be used for ? * Forum Rules *
ADO.au3 UDF * POP3.au3 UDF * XML.au3 UDF * IE on Windows 11 * How to ask ChatGPT for AutoIt Codefor other useful stuff click the following button:

  Reveal hidden contents

Signature last update: 2023-04-24

Posted (edited)

  On 1/30/2014 at 11:39 AM, Melba23 said:

But if you add in the time it would have taken me to develop a pattern like that (close to the known age of the universe I expect ;)) using an SRE would have taken longer for me than my initial suggestion! :D

M23

 

Melba,

Really too pessimistic about yourself !

Although you precisely described the purpose of this forum, didn't you ? :)

Edited by mikell
  • Solution
Posted

Another fine adjustment needed.

When the test data has a trailing linefeed as in mLipok's example of post #11, another additional linefeed is added. So "|z"is added to the RE pattern to stop the additional trailing linefeed being added to the output.

Local $txt = ''
$txt &= "TEST1"  & @TAB & "2013-02-08" & @TAB & "some text" & @CRLF
; HERE I want to add new line
$txt &= "TEST 2" & @TAB & "2013-2-08" & @TAB & "some text" & @CRLF
$txt &= "TEST 2" & @TAB & "2013-01-15" & @TAB & "some text" & @CRLF
$txt &= "TEST 2" & @TAB & "2013-01-15" & @TAB & "some text" & @CRLF
$txt &= "TEST 2" & @TAB & "2013-01-03" & @TAB & "some text" & @LF
; HERE I want to add new line
$txt &= "TEST3"  & @TAB & "2013-1-17" & @TAB & "some text" & @LF
; HERE I want to add new line
$txt &= "TEST4"  & @TAB & "2013-1-17" & @TAB & "some text" & @LF
$txt &= "TEST4"  & @TAB & "2013-1-18" & @TAB & "some text" & @LF
$txt &= "TEST4"  & @TAB & "2013-1-18" & @TAB & "some text" & @LF
; HERE I want to add new line
$txt &= "TEST5"  & @TAB & "2013-1-21" & @TAB & "some text" & @LF
$txt &= "TEST5"  & @TAB & "2013-1-21" & @TAB & "some text" & @CRLF
; HERE I want to add new line
$txt &= "TEST 5" & @TAB & "2013-1-21" & @TAB & "some text" & @LF

;$txt = StringRegExpReplace($txt, '((?<=\v|^)([^\t]+).+\R(?!\2))', '$1' & @crlf )    ; mikell's original
;$txt = StringRegExpReplace($txt, '((?<=\v|^)([^\t]+).+\R(?!\2|\z))', '$1' & @crlf ) ;  mikell's corrected for trailing @LF.
$txt = StringRegExpReplace($txt, '(?m)((^[^\t]+)\V+\R(?!\2|\z))', '$1' & @CRLF)      ; Malkey's corrected for trailing @LF.

ConsoleWrite($txt & @LF)

 

Posted

Exactly the same issue.

  Reveal hidden contents

This wonderful site allows debugging and testing regular expressions (many flavors available). An absolute must have in your bookmarks.
Another excellent RegExp tutorial. Don't forget downloading your copy of up-to-date pcretest.exe and pcregrep.exe here
RegExp tutorial: enough to get started
PCRE v8.33 regexp documentation latest available release and currently implemented in AutoIt beta.

SQLitespeed is another feature-rich premier SQLite manager (includes import/export). Well worth a try.
SQLite Expert (freeware Personal Edition or payware Pro version) is a very useful SQLite database manager.
An excellent eBook covering almost every aspect of SQLite3: a must-read for anyone doing serious work.
SQL tutorial (covers "generic" SQL, but most of it applies to SQLite as well)
A work-in-progress SQLite3 tutorial. Don't miss other LxyzTHW pages!
SQLite official website with full documentation (may be newer than the SQLite library that comes standard with AutoIt)

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...