ddeboer / tesseract
A wrapper for the Tesseract OCR engine
Installs: 126 609
Dependents: 0
Suggesters: 0
Security: 0
Stars: 21
Watchers: 7
Forks: 7
Open Issues: 1
Requires
This package is auto-updated.
Last update: 2024-11-06 09:20:54 UTC
README
Tesseract: a wrapper for the Tesseract OCR engine
A small PHP >=5.3 library that makes working with the open source Tesseract OCR engine easier.
Installation
You need a working Tesseract installation. For more information about installation and adding language support, see Tesseract’s README.
Then install this library, which is available on Packagist, through Composer:
$ composer require ddeboer/tesseract:1.0
Usage
If the tesseract
binary is in your path, just do:
use Ddeboer\Tesseract\Tesseract; $tesseract = new Tesseract();
Otherwise, construct Tesseract with the path to the binary:
$tesseract = new Tesseract('/usr/local/bin/tesseract');
Get version and supported languages information:
$version = $tesseract->getVersion(); $languages = $tesseract->getSupportedLanguages();
Perform OCR on an image file:
$text = $tesseract->recognize('myfile.tif');
Optionally, specify the language(s) as second argument:
$text = $tesseract->recognize('myfile.tif', array('nld', 'eng'));
And specify Tesseract’s page seg mode as third argument:
$text = $tesseract->recognize('myfile.tif', null, Tesseract::PAGE_SEG_MODE_AUTOMATIC_OSD);