Install Tesseract-OCR on CentOS 6

I try to install Tesseract-OCR on my server, however, when I install everything that in my opinion is the correct repo. When I try to install it, the package was not found.

I tried adding rpmforge, but to no avail. Any ideas from someone who has done this before or are familiar with adding and searching through repositories?

+9
install ocr tesseract centos
source share
7 answers

I used these instructions that worked correctly in Centos

Install Tesseract OCR libs from sources in Centos

Download the Leptonic and Teseract sources:

$ wget http://www.leptonica.org/source/leptonica-1.69.tar.gz $ wget https://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.02.tar.gz 

Configure, compile, install libraries:

  $ tar xzvf leptonica-1.69.tar.gz $ cd leptonica-1.69 $ ./configure $ make $ sudo make install $ tar xzf tesseract-ocr-3.02.02.tar.gz $ cd tesseract-3.01 $ ./autogen.sh $ ./configure $ make $ sudo make install $ sudo ldconfig 

Download the languages ​​(English) and copy to the tessdata folder:

 $ wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz $ tar xzf tesseract-ocr-3.02.eng.tar.gz $ sudo cp tesseract-ocr/tessdata/* /usr/local/share/tessdata 

and enjoy it;)

+8
source share

I recommend trying installing with rpm here: http://pkgs.org/download/tesseract There are also several dependencies: libpng-devel, libjpeg-devel, libtiff-devel, zlib and leptonica. The last 2 can also be found on the RPM website

+5
source share

This worked for me:

 /usr/bin/yum --enablerepo epel-testing install tesseract.x86_64 tesseract-langpack-fra.noarch 

tesseract is not in the epel repository, but in epel-testing repo witch is not activated by default.

+4
source share

I wrote a bash script to install Tesseract 3.05 on Centos 7. This allows you to install and install all the dependencies, and also installs language files for English, Hindi, Bengali and Thai.

Code is available on GitHub.

https://github.com/EisenVault/install-tesseract-redhat-centos

Hope this helps.

+4
source share

Install Tesseract OCR libs from sources (UPDATED as July 14, 2018)

Download the Leptonic and Teseract sources:

$ wget http://www.leptonica.com/source/leptonica-1.76.0.tar.gz

$ wget https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.02.02.tar.gz

Configure, compile, install Leptonica:

 $ tar xzvf leptonica-1.76.0.tar.gz $ cd leptonica-1.76.0 $ ./configure & make & sudo make install 

Configure, compile, install Tesseract:

 $ tar xzf tesseract-ocr-3.02.02.tar.gz $ cd tesseract-ocr $ ./autogen.sh & ./configure & make & sudo make install & sudo ldconfig 

Download language file:

Here I upload a file in English (eng.traineddata). Here you can view a complete list of language files and download to suit your needs. https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-302

Download the languages ​​(English) and copy to the tessdata folder:

 $ wget https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.02.eng.tar.gz $ tar xzf tesseract-ocr-3.02.eng.tar.gz $ sudo cp tesseract-ocr/tessdata/* /usr/local/share/tessdata 

Your Tesseract OCR is now installed and ready to use! Example:

 $tesseract /path/to/input/test.jpg /path/to/output/abc.txt -l eng 

Enjoy!!!

+1
source share

enter image description here

yum install --nogpgcheck tesseract

after installation for testing, enter the following command: tesseract --version

0
source share

I switched to Ubuntu. Tesseract is perfectly supported and installed without any problems.

 apt-get install tesseract-ocr 
-4
source share

All Articles