How to build tesseract 4 beta on macOS
转载这篇文章之后找到了官方的文档,建议官方文档,官方文档描述更全面。官方文档地址:https://github.com/tesseract-ocr/tesseract/wiki/Compiling
1 | brew info tesseract |
The result of recognition on Chinese - Simplified
is a little bit terrifying.
I noticed that it added a new neural network system based on LSTMs after 4.0.0+
But it need to be build from source code on macOS.
Thankfully, the manul is quit specify on their README.md
Install dependencies
1 | brew install automake autoconf autoconf-archive libtool |
Compile
1 | git clone https://github.com/tesseract-ocr/tesseract/ |
Their best trained modes, download the language chi_sim.traineddata and put it under tesseract/4.0.0.1/tessdata/
Usage
1 | tesseract image.png image -l chi_sim |
OK, it is still terrible under the Song typeface
font. It need to be trained a new model by myself.
文章转载自:http://artwalk.github.io/2018/05/06/How-to-build-tesseract-4-beta-on-macOS/
How to build tesseract 4 beta on macOS
http://www.mspring.org/2018/09/28/How-to-build-tesseract-4-beta-on-macOS/