Announcement

Collapse
No announcement yet.

Installing new Tesseract plugin on 64 bit?

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

    Installing new Tesseract plugin on 64 bit?

    After installing Tesseract from this install https://digi.bib.uni-mannheim.de/tes...-ga5ff320e.exe from https://github.com/UB-Mannheim/tesseract/wiki and installing the 64 bit plugins download for 4.67 64 bit I still get:

    Can't load Plugin: "OCR.DLL"

    In the Help... Installed Plugins OCR.DLL is not listed.

    Can anyone advise on how to get the new Tesseract OCR plugin working?

    (Also, is there a chance ever that Batch mode might work with OCR?)
    Last edited by Bhikkhu Pesala; 28.05.2024, 09:41 PM. Reason: Fixed typos

    #2
    I haven’t used OCR for years. What does your dialog look like in Settings, Plugins, OCR Options?
    Before you post ... Edit your profile • IrfanView 4.67 • Windows 10 Home 19045.2486

    Irfan PaintIrfan View HelpIrfanPaint HelpRiot.dllMore SkinsFastStone CaptureUploads

    Comment


      #3
      I use batch mode with drag and drop to a shorcut to do OCR with Tesseract. I use Tesseract from cygwin, but the Windows version should work. Try:

      rem --psm
      rem Page segmentation modes:
      rem 0 Orientation and script detection (OSD) only.
      rem 1 Automatic page segmentation with OSD.
      rem 2 Automatic page segmentation, but no OSD, or OCR. (not implemented)
      rem 3 Fully automatic page segmentation, but no OSD. (Default)
      rem 4 Assume a single column of text of variable sizes.
      rem 5 Assume a single uniform block of vertically aligned text.
      rem 6 Assume a single uniform block of text.
      rem 7 Treat the image as a single text line.
      rem 8 Treat the image as a single word.
      rem 9 Treat the image as a single word in a circle.
      rem 10 Treat the image as a single character.
      rem 11 Sparse text. Find as much text as possible in no particular order.
      rem 12 Sparse text with OSD.
      rem 13 Raw line. Treat the image as a single text line,
      rem bypassing hacks that are Tesseract-specific.
      rem

      cd %~p1
      tesseract %~nx1 OCR_out --psm 6
      start "" OCR_out.txt

      in a batch file like OCR.cmd

      you could then put the batch file in the "Open with external editor" list. See:



      Last edited by IrfanUserGuyxyx; 29.05.2024, 07:55 PM.

      Comment


        #4
        Hello there fellow Irfwanview lovers

        I just downloaded Tesseract 5.5.0 from https://github.com/UB-Mannheim/tesseract/wiki and worked like a charm on Irfanview, however I needed to OCR 600 images, so I used the command line and executed:

        (on the Tesseract installation folder, for me: C:\Program Files\Tesseract-OCR )

        for %n in ("c:\full\path\to\images\*.jpg") do tesseract.exe "%n" "%n" -l eng

        Then few seconds after a set of IMAGE.JPG.TXT where created!

        Note: the «"» SHOULD BE NORMAL ONES

        Comment


          #5
          Wow, works great, thanks!

          Comment

          Working...
          X