Sign in to follow this  
Followers 0
Seaborgium

Automatic OCR

12 posts in this topic

Is there any script for OCR that allow it to be automatic by keep scanning a folder for input images and auto OCR the image and save it in another folder?

Share this post


Link to post
Share on other sites



There are probably scripts for scanning a folder, and certainly various scripts that do stuff with OCR packages (try searching on OCR or tesseract). You will still have much work to do e.g. download and install the OCR package, get the AutoIt OCR integration working, get a scanning loop going ...

My experience with tesseract (a free OCR) is that it is not very good, but then I'm mostly working with technical displays with names like TESTOA004Z, and no OCR is going to do a good job on that.

Share this post


Link to post
Share on other sites

MODI is part of msoffice, but it's not included in 2010, but it can still be downloaded somewhere (google it)...results are not that great, as well....

Local $miDoc = ObjCreate("MODI.Document")


IEbyXPATH-Grab IE DOM objects by XPATH IEscriptRecord-Makings of an IE script recorder ExcelFromXML-Create Excel docs without excel installed GetAllWindowControls-Output all control data on a given window.

Share this post


Link to post
Share on other sites

Can i use other OCR like FreeOCR and use cygwin to make it automatic?

Share this post


Link to post
Share on other sites

MODI is free, with SharePoint Designer 2007, which i think is also free: http://support.microsoft.com/kb/982760

Then you can call it in your script...just search MODI in the form, and you will get examples.


IEbyXPATH-Grab IE DOM objects by XPATH IEscriptRecord-Makings of an IE script recorder ExcelFromXML-Create Excel docs without excel installed GetAllWindowControls-Output all control data on a given window.

Share this post


Link to post
Share on other sites

MODI could only open Tiff file or mdi file but i want to OCR Jpg files

Share this post


Link to post
Share on other sites

save as a tiff :)...but i was able to get png and jpg to work, so not sure what error you are ref to.


IEbyXPATH-Grab IE DOM objects by XPATH IEscriptRecord-Makings of an IE script recorder ExcelFromXML-Create Excel docs without excel installed GetAllWindowControls-Output all control data on a given window.

Share this post


Link to post
Share on other sites

I used this person script however i do now have any knowledge on this script. I wan the output of the Ocr to save into Microsoft words. How do i edit the script to do that?

Share this post


Link to post
Share on other sites

Bump. I wan to paste the stored value in the array into microsoft word. how do i do that?

Share this post


Link to post
Share on other sites

The function u've listed retruns words in an array...throw that array into: _FileWriteFromArray


IEbyXPATH-Grab IE DOM objects by XPATH IEscriptRecord-Makings of an IE script recorder ExcelFromXML-Create Excel docs without excel installed GetAllWindowControls-Output all control data on a given window.

Share this post


Link to post
Share on other sites

Thanks for your help jdelaney that code worked.

Share this post


Link to post
Share on other sites

How do i match my ocr output with my postgresql database?

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!


Register a new account

Sign in

Already have an account? Sign in here.


Sign In Now
Sign in to follow this  
Followers 0