Bert Posted April 12, 2006 Share Posted April 12, 2006 I have a string that can look like the following example:APPLE POWERMAC G4|APPLE-IMAC|COMPAQ DC5000|COMPAQ DESKPRO 2000|COMPAQ DESKPRO 4000|COMPAQ DESKPRO 5100|COMPAQ DESKPRO 6000|COMPAQ DESKPRO EN|COMPAQ DESKPRO EN SFF|COMPAQ DESKPRO EN-CMT|COMPAQ DESKPRO EP|COMPAQ EVO D500|COMPAQ EVO D510|COMPAQ EVO D530|COMPAQ EVO D530 CMT|COMPAQ EVO W6000|COMPAQ EVO W8000|COMPAQ PROLIANT HD|COMPAQ PROLIANT ML330|COMPAQ PROLIANT ML380|COMPAQ PROSIGNIA|COMPAQ SP 750|COMPAQ XW6000|DELL OPTIPLEX|HP DC7100|IBM PC 300GL|IBM PC 300PL|UNISYS TELLER STATIONEach entry has a "|" separating them. What I'm looking to do is just extract from the list the ones that meet my need. For example, in the above string I want to capture any item that begins with the letter "C". Once captured, I want to put that information in a new string so my output would look like this: COMPAQ DC5000|COMPAQ DESKPRO 2000|COMPAQ DESKPRO 4000|COMPAQ DESKPRO 5100|COMPAQ DESKPRO 6000|COMPAQ DESKPRO EN|COMPAQ DESKPRO EN SFF|COMPAQ DESKPRO EN-CMT|COMPAQ DESKPRO EP|COMPAQ EVO D500|COMPAQ EVO D510|COMPAQ EVO D530|COMPAQ EVO D530 CMT|COMPAQ EVO W6000|COMPAQ EVO W8000|COMPAQ PROLIANT HD|COMPAQ PROLIANT ML330|COMPAQ PROLIANT ML380|COMPAQ PROSIGNIA|COMPAQ SP 750|COMPAQ XW6000I'm having to go through and sort a list that is 15,000 items long. The above example is one of many things I have to sort, so I need to make it so it can be easily changed to the criteria I need. I'm not sure on how to search for this, for as you can see, my question is very word intensive. I'm assuming this is just a couple lines of code. I'm just not sure how to do it. Thoughts? The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
nfwu Posted April 12, 2006 Share Posted April 12, 2006 $ary = StringSplit($data, "|") $stack = $_StackEmpty For $i = 1 to $ary[0] If StringLeft($ary[$i], 1) = "C" Then _StackPush($stack, $ary[$i]) EndIf Next ;;;;Now, $stack is your array of values.Requires my Stack UDFs:#region _Stack UDFs by nfwu Global Const $_StackEmpty = "Empty" Func _StackPop(ByRef $avArray) Local $sLastVal If (Not IsArray($avArray)) Then SetError(1) Return $_StackEmpty EndIf $sLastVal = $avArray[UBound($avArray) - 1] If UBound($avArray) = 1 Then $avArray = $_StackEmpty Else ReDim $avArray[UBound($avArray) - 1] EndIf Return $sLastVal EndFunc Func _StackPush(ByRef $avArray, $sValue) IF IsArray( $avArray ) Then ReDim $avArray[Ubound($avArray)+1] Else Dim $avArray[1] EndIf $avArray[UBound($avArray)-1] = $sValue SetError(0) Return 1 EndFunc #endregionYou could modify this to use regular expressions, if you want.#) TwitterOut of date stuff:Scripts: Sudoku Solver | Webserver | 3D library (Pure AutoIt) | Wood's GadgetsUDFs: _WoodUniqueID() | _DialogEditIni() | _Console*() | _GetIPConfigData() | _URLEncode/Decode() Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 I have a string that can look like the following example: Each entry has a "|" separating them. What I'm looking to do is just extract from the list the ones that meet my need. For example, in the above string I want to capture any item that begins with the letter "C". Once captured, I want to put that information in a new string so my output would look like this: I'm having to go through and sort a list that is 15,000 items long. The above example is one of many things I have to sort, so I need to make it so it can be easily changed to the criteria I need. I'm not sure on how to search for this, for as you can see, my question is very word intensive. I'm assuming this is just a couple lines of code. I'm just not sure how to do it. Thoughts?From the help file: StringRegExp ( "test", "pattern" [, flag ] ) Flag Values: 0 Return true/false (1/0) as to whether the test matched the pattern. 1 Return an array with the text that matched all the group patterns. Check @Extended to determine whether the pattern matched or not. 2 Same as 0. 3 Perform a global search, checking the entire string, returning an array of all results. Check @Extended to determine whether the pattern matched or not. So somthing like this should give you somthing to work with: NOTE: I have not tested so its a sugestion to work with $data= "|Compaq test|HP test|Code test|" $arr = StringRegExp("|C.*|",$data,3) Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
nfwu Posted April 12, 2006 Share Posted April 12, 2006 @Uten: Now that is a smarter method! #) TwitterOut of date stuff:Scripts: Sudoku Solver | Webserver | 3D library (Pure AutoIt) | Wood's GadgetsUDFs: _WoodUniqueID() | _DialogEditIni() | _Console*() | _GetIPConfigData() | _URLEncode/Decode() Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 (edited) Tried it, and it didn't work I get this in Scite for output:>AutoIT3.exe ended.rc:128 Edited April 12, 2006 by vollyman The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
LostUser Posted April 12, 2006 Share Posted April 12, 2006 So somthing like this should give you somthing to work with: NOTE: I have not tested so its a sugestion to work with $data= "|Compaq test|HP test|Code test|" $arr = StringRegExp("|C.*|",$data,3) Very smart and short. vollyman, you just need to make sure that your data file/string contains the | character as the first and last characters like vollyman's otherwise you could miss the first and last possible data matches. Be open minded but not gullible.A hammer sees everything as a nail ... so don't be A tool ... be many tools. Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 I tried that. I tried every which way possible, and it doesn't report anything. I tried the following to get a responce: "|C.*|" "C.*" "C" I get nothing for a return string. The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 (edited) I even tried this, and this didn't work either. WHAT AM I DOING WRONG??? $data= "Compaq test|HP test|Code test|" $arr = StringRegExp("Compaq test|HP test|Code test|",$data,1) MsgBox(0,"",$arr);test to see if desired string is correct I still get a blank for a return string. $data= "Compaq test|HP test|Code test|" $arr = StringRegExp("|C.*|",$data,0) MsgBox(0,"",$arr) this gives a blank for a return string. Edited April 12, 2006 by vollyman The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 NOTE: In my sample post I goofed and switched the pattern and the data entries in the StringRegExp call. Sorry about that But, StringRegExp does not behave as I expected, not even after reading the documentation. So some research is in its place. My test code, reveals that there is no array returned. #include <Array.au3> $data= ";Compaq test;HP test;Code test;" $arr = StringRegExp($data,"C.*t",3) ConsoleWrite("@error:=" & @error & ", @Extended:=" & @extended & @LF) if not IsArray($arr) then ConsoleWrite("ERROR: Did not return array" & @LF) _ArrayDisplay($arr, "Result") I thought a | could have special meaning, and it does (in the pattern) so I have removed it in this test code. Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 Tried this also, and I get a blank for a return expandcollapse popupGlobal Const $_StackEmpty = "Empty" $data= "Compaq test|HP test|Code test|Compaq test2|Compaq test3" ;$arr = StringRegExp("[Compaq]",$data,3) $ary = StringSplit($data, "|") $stack = $_StackEmpty For $i = 1 to $ary[0] If StringLeft($ary[$i], 1) = "C" Then _StackPush($stack, $ary[$i]) EndIf Next ;;;;Now, $stack is your array of values. MsgBox(0,"",$stack);test to see if desired string is correct #region _Stack UDFs by nfwu Func _StackPop(ByRef $avArray) Local $sLastVal If (Not IsArray($avArray)) Then SetError(1) Return $_StackEmpty EndIf $sLastVal = $avArray[UBound($avArray) - 1] If UBound($avArray) = 1 Then $avArray = $_StackEmpty Else ReDim $avArray[UBound($avArray) - 1] EndIf Return $sLastVal EndFunc Func _StackPush(ByRef $avArray, $sValue) IF IsArray( $avArray ) Then ReDim $avArray[Ubound($avArray)+1] Else Dim $avArray[1] EndIf $avArray[UBound($avArray)-1] = $sValue SetError(0) Return 1 EndFunc #endregion The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 Turns out we have to wrap up any pattern in (). Like this: #include <Array.au3> $data= ";Compaq test;HP test;Code test;" $arr = StringRegExp($data,"(;C[a-zA-Z0-9 ]*;)",3) ConsoleWrite("@error:=" & @error & ", @Extended:=" & @extended & @LF) if not IsArray($arr) then ConsoleWrite("ERROR: Did not return array" & @LF) _ArrayDisplay($arr, "Result") So no you have to sort out the |. Think I leav that out as an exerice Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 (edited) Some background. I have a program that reads the string and puts it into a list. The string must be in this format for the program to read it: item1|item2|item3 What I'm attempting is to design a update script that pulls data from a 15,000+ item excel spreadsheet and transforms it to the string format I need. I can do this part and put the data into a string, but I need to pull out sections depending on what the items are, such as all compaq items. when I pull the items, I need to create a string with just those items, and in the same format. If you look in my first post, you will see what I mean. I really need to solve this problem with working code. Edited April 12, 2006 by vollyman The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 I need the "|" for the list, and in some cases, my list can be over 5000 items.Also, I need the output string in the format like this: item1|item2|item3Having it listed in an array wont work for what I need it to do.So you have to find a way to include the | or find another solution, whatever suites you best. Happy hunting Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 Turns out we have to wrap up any pattern in (). Like this: #include <Array.au3> $data= ";Compaq test;HP test;Code test;" $arr = StringRegExp($data,"(;C[a-zA-Z0-9 ]*;)",3) ConsoleWrite("@error:=" & @error & ", @Extended:=" & @extended & @LF) if not IsArray($arr) then ConsoleWrite("ERROR: Did not return array" & @LF) _ArrayDisplay($arr, "Result") So no you have to sort out the |. Think I leav that out as an exerice Some background. I have a program that reads the string and puts it into a list. The string must be in this format for the program to read it: item1|item2|item3 What I'm attempting is to design a update script that pulls data from a 15,000+ item excel spreadsheet and transforms it to the string format I need. I can do this part and put the data into a string, but I need to pull out sections depending on what the items are, such as all compaq items. when I pull the items, I need to create a string with just those items, and in the same format. If you look in my first post, you will see what I mean. I really need to solve this problem with working code. Obviously your out of imagination at the moment @vollyman. You hava a solution solving what you want, you just have to tweek it a bit. But it is not good enough for you since it does not give you the entier solution. Even thought I'm abit pissed by that kind of atitude (you probably have a good exuse, and I probably have it my selfe from time to time, and you did modify your post to the better). So I wil give you the solution: #include <Array.au3> $data= "|Compaq test|HP test|Code test|Some thinge else|" $arr = StringRegExpReplace($data,"(\|[^C][a-zA-Z0-9 ]*\|)","|") ConsoleWrite("@error:=" & @error & ", @Extended:=" & @extended & @LF) ConsoleWrite("$arr:=" & $arr & @LF) Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 I'm really not trying to be annoying. believe me. I've been beating my head against the wall about this for several days now in fustration. sorry to offend. I tried the solution you gave. It doesn't work. Again, I'm not trying to be a pain in the butt here. The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 (edited) I'm really not trying to be annoying. believe me. I've been beating my head against the wall about this for several days now in fustration. sorry to offend.I know how it is, realy. I tried the solution you gave. It doesn't work. Again, I'm not trying to be a pain in the butt here. What did it return. On my system it returns: @error:=0, @Extended:=2 $arr:=|Compaq test|Code test| Witch is what you want, is it not? It is probably a matter of creating the right regexp, wiche can be a real pain in the but EDIT: I did not notice this before I posted but @Extended returns a warning even as the result is as expected. Edited April 12, 2006 by Uten Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 I'm getting a error saying it didn't retrun the array. I tried this, and it works, but it is messy. Also, I noticed it missed one of the items: #include <Array.au3> $_dataget= ";Compaq test;HP test;Code test;Compaq test1;HP test;Code test1;" $replace = StringReplace($_dataget,"|",";") ;MsgBox(0,"",$replace) $arr = StringRegExp($_dataget,"(;C[a-zA-Z0-9 ]*;)",3) ConsoleWrite("@error:=" & @error & ", @Extended:=" & @extended & @LF) if not IsArray($arr) then ConsoleWrite("ERROR: Did not return array" & @LF) ;_ArrayDisplay($arr, "Result") $result1 =_ArrayToString($arr,"+") $replace2 = StringReplace($result1,";","|") $replace3 = StringReplace($replace2,"+","") $replace2a = StringReplace($replace3,"||","|") MsgBox(0, "this is the result",$replace2a) The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 Ok, I must have made a mistake, for your code is working, but in testing I changed the data to look like this:$data= "|Compaq test|HP test|Other data|Even more data|Code test|Some thinge else|"and the output would be |Compaq test|Other data|Code test|I tried this data: "|Compaq test|HP test|Other data|Even more data|Code test|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else" and got this: |Compaq test|Other data|Code test|Some thinge else|Some thinge else|Some thinge else|Some thinge else The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Uten Posted April 12, 2006 Share Posted April 12, 2006 Ok, I must have made a mistake, for your code is working, but in testing I changed the data to look like this:$data= "|Compaq test|HP test|Other data|Even more data|Code test|Some thinge else|" and the output would be |Compaq test|Other data|Code test| I tried this data: "|Compaq test|HP test|Other data|Even more data|Code test|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else" and got this: |Compaq test|Other data|Code test|Some thinge else|Some thinge else|Some thinge else|Some thinge else I think the StringRegExpReplace has some odd behaviour. Probably due to the fact that we replace a part of the string with somthing we want to be a part of the next search. So try this: #include <Array.au3> ;$data = "|Compaq test|HP test|Other data|Even more data|Code test|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|Some thinge else|" $data = "|APPLE POWERMAC G4|APPLE-IMAC|COMPAQ DC5000|COMPAQ DESKPRO 2000|COMPAQ DESKPRO 4000|COMPAQ DESKPRO 5100|COMPAQ DESKPRO 6000|COMPAQ DESKPRO EN|COMPAQ DESKPRO EN SFF|COMPAQ DESKPRO EN-CMT|COMPAQ DESKPRO EP|COMPAQ EVO D500|COMPAQ EVO D510|COMPAQ EVO D530|COMPAQ EVO D530 CMT|COMPAQ EVO W6000|COMPAQ EVO W8000|COMPAQ PROLIANT HD|COMPAQ PROLIANT ML330|COMPAQ PROLIANT ML380|COMPAQ PROSIGNIA|COMPAQ SP 750|COMPAQ XW6000|DELL OPTIPLEX|HP DC7100|IBM PC 300GL|IBM PC 300PL|UNISYS TELLER STATION|" do $data = StringRegExpReplace($data,"([^a-zA-Z0-9 ][^C][a-zA-Z0-9 ]*[^a-zA-Z0-9 ])","|") $res = @extended until $res <= 0 ;$arr = StringRegExpReplace($arr,"([\|][^C][a-zA-Z0-9 ]*[\|])","|") ConsoleWrite("@error:=" & @error & ", @Extended:=" & @extended & @LF) ConsoleWrite("$arr:=" & $data & @LF) On my system it ruturns: @error:=0, @Extended:=0 $arr:=|COMPAQ DC5000|COMPAQ DESKPRO 2000|COMPAQ DESKPRO 4000|COMPAQ DESKPRO 5100|COMPAQ DESKPRO 6000|COMPAQ DESKPRO EN|COMPAQ DESKPRO EN SFF|COMPAQ DESKPRO EN-CMT|COMPAQ DESKPRO EP|COMPAQ EVO D500|COMPAQ EVO D510|COMPAQ EVO D530|COMPAQ EVO D530 CMT|COMPAQ EVO W6000|COMPAQ EVO W8000|COMPAQ PROLIANT HD|COMPAQ PROLIANT ML330|COMPAQ PROLIANT ML380|COMPAQ PROSIGNIA|COMPAQ SP 750|COMPAQ XW6000| Please keep your sig. small! Use the help file. Search the forum. Then ask unresolved questions :) Script plugin demo, Simple Trace udf, TrayMenuEx udf, IOChatter demo, freebasic multithreaded dll sample, PostMessage, Aspell, Code profiling Link to comment Share on other sites More sharing options...
Bert Posted April 12, 2006 Author Share Posted April 12, 2006 I tried the script, and decided to try it with a different string to see what would happen. I had the script connect to the script that would put out the string I need to pull data from. I wanted to capture every application we list in the sheet. In the "C" section, we list about 50 applications. When I run it against the script you gave me, it kicked out only 17. hmmmm.... This search string thing is quite tricky to do. The Vollatran project My blog: http://www.vollysinterestingshit.com/ Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now