Question

我尝试将tesseract用于图片的OCR，并且我想禁用tesseract正在扫描的页面的某些详细输出：

:~$ tesseract stdin stdout -l eng txt
Page 1
<ocr output>

是否可以从输出中删除“页面1”？

:~$ tesseract --version
tesseract 4.0.0-146-gc39a

Answer 1

在命令末尾尝试quiet选项。

Answer 2

如果您只想查看OCR文本，则只需将stderr重定向为null。

foo | tesseract - - 2>/dev/null

或者，当然，如果需要，也可以保存到日志文件。