Question

我刚刚安装了Tesseract OCR，在运行命令$ tesseract --list-langs后，输出只显示了两种语言，eng和osd。我的问题是，如何加载另一种语言，特别是日语中的语言？

Answer 1

我通过从https://github.com/tesseract-ocr/tessdata抓取训练数据并将其放在与其他训练数据相同的目录中，即eng.traineddata并通过语言标志-l LANG来学习能够阅读您指定的语言，在以下示例中，日语：tesseract -l jpn sample-jpn.png output-jpn。

Answer 2

这对我有用：

sudo apt-get install tesseract-ocr-jpn

希望this will help。

Tesseract OCR加载语言 - 日语

3 个答案: