Jump to content

Optical character recognition GUI


Recommended Posts

to easily generate definition for OCR, there must be a gui program, this is what it should do.

Select parts of the screen with a click and drag function.

A control to Enlarge part of that screen capture to per pixel level, define areas wanted for definition for that enlargement,

tick off area to not have the definition generated,

what this means is that this OCR search function can also search for graphics, because

when you selection " a " , the white part within the a area is not what we consider to be "a"

the user will have to round up in a square with on the "a" inside and tick off the small squares not wanted to have its definition

genertated.

then it generate the definition !

For this to work, what existing OCR project should be used?

it must be able to generate definition for any graphics, and then definition file has to be valid for later improved OCR search program.

Link to comment
Share on other sites

Well you're making it way too complicated. You don't need to manually select each character or enlarge/manipulate any graphics.

Just lay out a table of the font: a-zA-Z0-9 and some punctuation. If there isn't spacing between the characters, you're going to have a hard time defining each.

"I thoroughly disapprove of duels. If a man should challenge me, I would take him kindly and forgivingly by the hand and lead him to a quiet place and kill him." - Mark TwainPatient: "It hurts when I do $var_"Doctor: "Don't do $var_" - Lar.
Link to comment
Share on other sites

I have already written an OCR that will scan an image for a letter and then write a file of its positions and then you can also verify it using another script I wrote. Some of this requires a bit of manual work... Sorry, not entirely automated. When you run the script the gui should explain it a bit. You NEED TO MAKE A FOLDER CALLED "Letters" IN THE SCRIPTS DIRECTORY. Don't have the picture of the letter zoomed in when you scan it. That will generate a character file with its positions where the first line is the number of the rest of the lines and the rest of the lines are an x value then a y value... so to demonstrate..

4

0 ;; Just a comment this is an x value

0 ;; this is a y value, comments like these cannot be in the character file though.

1

1

2

3

3

4

So that just records the positions in a file... Then you have another script (REQUIRES TO BE IN THE SAME DIRECTORY AS PREVIOUS SCRIPT TO HAVE ACCESS TO THE CHARACTER DATA FILES) that can paint the letter in MS paint if you have it open. You just press the draw function in MS Paint type in the letter you want drawn (it has to be scanned and in the letters folder first).. Once you have it drawn (or you can open up the picture from the original scanning) you can find the letter too. IT DOESN'T MENTION THIS IN THE GUI, YOU MUST PRESS CONTROL + C TO SET THE COLOR OF THE LETTER YOU ARE SEARCHING FOR! You can also set a value to step over some positions.. anything over 6 gives you false recognitions. Then just press Control + F to find the letter. I know.. a bit complicated but it works. The only REAL problem is the time it takes to scan.. if anyone can look at my code and speed up the recognition process I'd be VERY grateful.. Can't make a proper bot that takes so long to scan for words lol.

OCRtool.au3

PaintALetter.au3

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
 Share

  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...