KickStarter15 Posted May 16, 2018 Share Posted May 16, 2018 (edited) Hi Experts, Good day! I just want to know and ask if this is possible to convert table in image format to editable table text (attaching sample tables in image format). I've found this site that can do the trick so maybe, this is also possible in AutoIt. Sorry but I could not start one. I also, found this thread "Image to Text" posted by @NeoFoX long time ago, but still it would not do the way I wanted to happened. Any suggestions or link to where I can start this idea if there is/are? Attaching as well the converted table (in .docx format) from this site for reference in case. Thanks in advance Experts. KS15 Table1a.docx Table1.docx Edited May 16, 2018 by KickStarter15 Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 17, 2018 Author Share Posted May 17, 2018 Do you think this is not possible for autoit scripting? Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
Andreik Posted May 17, 2018 Share Posted May 17, 2018 It's not language related as much as technology related, obviously you need an OCR engine. When the words fail... music speaks. Link to comment Share on other sites More sharing options...
jdelaney Posted May 17, 2018 Share Posted May 17, 2018 Search for modi or teseract for ocr...this will be a difficult and unreliable undertaking IEbyXPATH-Grab IE DOM objects by XPATH IEscriptRecord-Makings of an IE script recorder ExcelFromXML-Create Excel docs without excel installed GetAllWindowControls-Output all control data on a given window. Link to comment Share on other sites More sharing options...
aa2zz6 Posted May 17, 2018 Share Posted May 17, 2018 (edited) I use ocr.space to get text from our dimp sheets at work. The accuracy is really good. I made a post somewhere on the forums for using an API key so you can probably find it somewhere on the forums. They offer free and paid services but you probably won't scan more than 500 items per day. Edited May 19, 2018 by aa2zz6 Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 19, 2018 Author Share Posted May 19, 2018 Thanks, Experts... Yeah, I know that this is not possible for autoit coding. just trying if there is someone know how. I'll check with OCR engine and see what idea that I can get from there, but still if there's someone out there that has an idea or other suggestions or any opinions to share then that would be great. For now, I think my project will be pending after all. I could not even start a single code from this. Wheeewhh! Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
aa2zz6 Posted May 19, 2018 Share Posted May 19, 2018 15 hours ago, KickStarter15 said: Thanks, Experts... Yeah, I know that this is not possible for autoit coding. just trying if there is someone know how. I'll check with OCR engine and see what idea that I can get from there, but still if there's someone out there that has an idea or other suggestions or any opinions to share then that would be great. For now, I think my project will be pending after all. I could not even start a single code from this. Wheeewhh! I do have something but it's too large to post it. If you'd like I can upload it to dropbox and invite you to that unless there is another way. Maybe you can figure out a way to make it more accurate and share with me Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 21, 2018 Author Share Posted May 21, 2018 @aa2zz6, Thanks, How about zipping the .au3 file and send it to my inbox, would that be okay? I don't have dropbox in my PC right now. Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 22, 2018 Author Share Posted May 22, 2018 Hi Experts, Just an add-on suggestion, If I upload the file to this site would that be possible by using autoit? Like, I'll create a GUI that will upload and download the file from that site. Can it be done without opening the site in any browser? hmmmp kinda weird isn't.. Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 24, 2018 Author Share Posted May 24, 2018 Hi Experts, Any suggestions on how can I do this using autoit to upload my image file to that site? And/Or is there any other way that we can have this in autoit? huhu kind of lost right now and not sure how can I do this, please need your help on this one guys. KS15 Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
FrancescoDiMuro Posted May 24, 2018 Share Posted May 24, 2018 Hey @KickStarter15 I suggest you to use some software like Tesseract, which offers command line tool, and then automate it. Feel free to ask if something is not clear Have a good day. Best Regards. Click here to see my signature: Spoiler ALWAYS GOOD TO READ: Forum Rules Forum Etiquette Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 24, 2018 Author Share Posted May 24, 2018 @FrancescoDiMuro, Thanks, but I'm not the admin of this computer so that application will not be installed on my PC for checking. Is there any other way on how to check on Tesseract? Please just need to start for this one. Thanks! Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
aa2zz6 Posted May 24, 2018 Share Posted May 24, 2018 The tesseract-ocr files that Francesco is mentioning can be found on the github site: https://github.com/UB-Mannheim/tesseract If you're not able to dl these files for whatever reason you can refer to my post about sending request to a ocr service and returning the results. Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 25, 2018 Author Share Posted May 25, 2018 @aa2zz6, Thanks, I already tried using your suggested post but how can I add this site: "https://www.onlineocr.net/" to be working on your suggested codes given here. Also, not sure where to get the below tesseract.exe after I've downloaded tesseract in this site: " https://github.com/UB-Mannheim/tesseract ". Const $TesseractExePath = @ScriptDir & "\tesseract-master\tesseract.exe" ; not found in my computer Same with the below not found as well on my Computer. Const $NconvertExePath = @ScriptDir & "\Data\nconvert.exe" ; not found in my computer. Thanks in advance. Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
IAMK Posted May 25, 2018 Share Posted May 25, 2018 I have read up on Tesseract before, but never used it myself. Tesseract will spit you text from a region you ask for. Not only will you need Tesseract, but also logic to find cells (regions). First, find vertical lines, which will give you your columns, Then, find horizontal lines, which will give you your column titles. Then, search Column one for Text, following by whitespace (hardest part, since some tables could have two rows next to each other). Easy hack = draw lines between them. Then you can carry the Tesseract returns for each cell into excel or whatever you desire. As for Picture 2's bottom 2 sections, I don't know how you want to handle that. Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 25, 2018 Author Share Posted May 25, 2018 Yeah @IAMK, It's so funny that these images were presented that way.. wheeehhh,,, I don't know how to handle this issue anymore. Well, trying to check all suggestions posted on this thread and yet no results found. Maybe I need to dig deep in google search for more information and I am still hoping that hopefully someone in this forum can help me with these problem. As they said "Patience is a virtue". On the other hand, I'm still waiting with aa2zz6's response... I think he had these ideas that I need for this concern. Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
aa2zz6 Posted May 25, 2018 Share Posted May 25, 2018 (edited) 14 hours ago, KickStarter15 said: Const $NconvertExePath = @ScriptDir & "\Data\nconvert.exe" ; not found in my computer. Thanks in advance. You can DL nconvert.exe here: https://www.xnview.com/en/nconvert/ .. I will also attach a folder with everything but the tesseract stuff because it's too big to attached here. You will need to place that downloaded tesseract folder in the Data folder. Follow how I have it setup in the images below. Make sure the tesseract folder name is the same as in the images below or change the path names in the autoit script. I look forward to how you tackle this problem ^^ ocr AutoIt projects.zip Edited May 25, 2018 by aa2zz6 Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 26, 2018 Author Share Posted May 26, 2018 @aa2zz6, Thanks, man... Yup done with the downloading except for the "tesseract.exe" itself it was not included in the download (see attached extracted file downloaded). Can you attached the "tesseract.exe" for me in this thread? all folders and others were present already but not tesseract.exe. Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
aa2zz6 Posted May 26, 2018 Share Posted May 26, 2018 5 hours ago, KickStarter15 said: @aa2zz6, Thanks, man... Yup done with the downloading except for the "tesseract.exe" itself it was not included in the download (see attached extracted file downloaded). Can you attached the "tesseract.exe" for me in this thread? all folders and others were present already but not tesseract.exe. Here you go tesseract.exe Link to comment Share on other sites More sharing options...
KickStarter15 Posted May 28, 2018 Author Share Posted May 28, 2018 (edited) @aa2zz6, done with the testing but when tried, it pop-up the compatibility issue with 64bit or 32bit OS, I have 32bit on my computer. Does your the tesseract.exe attached support 32bit? I also encountered error in missing .dll file when continuing running the code. Please can you advise on this? P.S. I think, my tesseract has no .dll files included when downloaded. Edited May 28, 2018 by KickStarter15 Programming is "To make it so simple that there are obviously no deficiencies" or "To make it so complicated that there are no obvious deficiencies" by C.A.R. Hoare. Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now