Question

Tesseract无法读取多维数据集语言模型。 tesseract 1.png output.txt -l hin 执行上述命令后，发生以下错误。

Cube ERROR (CubeRecoContext::Load): unable to read cube language model params from /usr/share/tesseract-ocr/tessdata/hin.cube.lm
Cube ERROR (CubeRecoContext::Create): unable to init CubeRecoContext object
init_cube_objects(false, &tessdata_manager):Error:Assert failed:in file tessedit.cpp, line 207
Segmentation fault

我获取 hin.cube.lm 文件以及如何处理该文件？

Answer 1

我通过安装以下文件的正确版本来修复此错误：

hin.cube.bigrams
hin.cube.fold
hin.cube.lm
hin.cube.nn
hin.cube.params
hin.cube.word-FREQ
hin.tesseract_cube.nn

以及印地语和英语培训数据的正确版本。

以上所有文件均可在以下位置获得： https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-304305

我将这些文件放在：/ usr / local / share / tessdata

下

这是在CentOS 7.2上

无法在tesseract

1 个答案: