Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Optical Character Recognition (OCR) With TESS4J

DZone's Guide to

Optical Character Recognition (OCR) With TESS4J

Sometimes you just need character recognition. Enter Tess4j. Here's how to implement optical character recognition for images and documents.

· Web Dev Zone
Free Resource

Start coding today to experience the powerful engine that drives data application’s development, brought to you in partnership with Qlik.

Tess4j is a JNA-based wrapper for Tesseract OCR DLL, the library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

How To Run The Sample

Step 1 :Download the Maven  project from here

Step 2 : Run the Example

Add VM Argument

64 bit

-Djna.library.path=${workspace_loc:/ocr-tess4j-example}/dlls/x64

32 bit

-Djna.library.path=${workspace_loc:/ocr-tess4j-example}/dlls/x86

ocr6

  Step 3  : Output

ocr4ocr5



Create data driven applications in Qlik’s free and easy to use coding environment, brought to you in partnership with Qlik.

Topics:
.net ,java ,web dev ,tess4j

Published at DZone with permission of Mohammad Nadeem, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

THE DZONE NEWSLETTER

Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

X

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}