Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Optical Character Recognition (OCR) With TESS4J

DZone's Guide to

Optical Character Recognition (OCR) With TESS4J

Sometimes you just need character recognition. Enter Tess4j. Here's how to implement optical character recognition for images and documents.

· Web Dev Zone
Free Resource

Get deep insight into Node.js applications with real-time metrics, CPU profiling, and heap snapshots with N|Solid from NodeSource. Learn more.

Tess4j is a JNA-based wrapper for Tesseract OCR DLL, the library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

How To Run The Sample

Step 1 :Download the Maven  project from here

Step 2 : Run the Example

Add VM Argument

64 bit

-Djna.library.path=${workspace_loc:/ocr-tess4j-example}/dlls/x64

32 bit

-Djna.library.path=${workspace_loc:/ocr-tess4j-example}/dlls/x86

ocr6

  Step 3  : Output

ocr4ocr5



Node.js application metrics sent directly to any statsd-compliant system. Get N|Solid

Topics:
.net ,java ,web dev ,tess4j

Published at DZone with permission of Mohammad Nadeem, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}