Max82 Posted November 29, 2008 Share Posted November 29, 2008 Someone can help me please? I am a novice in Autoit programming. I have to do for my teacher a little program that checks all the files inside a folder (about 10000 pdf files), for each one checks sequentially if it's an image file (a simple scanned image without text) or if it's a real pdf file with text inside it (OCR), and moves every image file (without text) in another folder for further processing (with an OCR program). Doing the same routine manually would be an endless task, so the teacher asks me to program this little software to do so. It's possible with Autoit? Maybe creating an array? And how could I check the presence of text inside each file? Any suggestions of anybody will be greatly appreciated. Max from Rome (Italy) Link to comment Share on other sites More sharing options...
Richard Robertson Posted November 29, 2008 Share Posted November 29, 2008 There are a number of ways you could do this. You could open the PDF then run some OCR on it and see if you get any response, or you could try "selecting all" text in the PDF document and copying it. If you get some text you'll know it has some. You could also spend a month or so studying the PDF structure. This would allow you to read it in a binary mode and detect whether there is text or not. Also, wrong forum. Link to comment Share on other sites More sharing options...
Developers Jos Posted November 29, 2008 Developers Share Posted November 29, 2008 please don't double post questions. SciTE4AutoIt3 Full installer Download page - Beta files Read before posting How to post scriptsource Forum etiquette Forum Rules Live for the present, Dream of the future, Learn from the past. Link to comment Share on other sites More sharing options...
Recommended Posts