我正在尝试使用Tesseract通过heroku部署OCR应用程序。 WAITFOR
设置wget
文件会导致错误,因此我决定将它们压缩为.traineddata
文件,然后将其上传到从中构建应用程序的自定义buildpack中。但是,仅存在默认的.tar.gz
文件,而其他eng.traineddata
文件正在以某种方式被删除。构建日志显示文件已成功提取。在下面粘贴构建日志:
.traineddata
以下是Counting objects: 1, done.
Writing objects: 100% (1/1), 183 bytes | 0 bytes/s, done.
Total 1 (delta 0), reused 0 (delta 0)
remote: Compressing source files... done.
remote: Building source:
remote:
remote: -----> Python app detected
remote: Using supported version of Python 3.6 (python-3.6.6)
remote: -----> Installing requirements with pip
remote:
remote: -----> detect app detected
remote: Unpacking Tesseract-OCR binaries
remote: ./
remote: ./bin/
remote: ./bin/tesseract
remote: ./share/
remote: ./share/tessdata/
remote: ./share/tessdata/eng.traineddata
remote: ./lib/
remote: ./lib/libtesseract.so.4
remote: ./lib/liblept.so.5
remote: ./lib/libpng12.so.0
remote: ./lib/libgif.so.7
remote: Getting Tesseract-OCR training data
remote: eng.traineddata
remote: ben.traineddata
remote: eng training data
remote: ben training data
remote: traineddata files loaded
remote: ben.traineddata
remote: eng.traineddata
remote: Building runtime environment for Tesseract-OCR
remote: -----> Discovering process types
remote: Procfile declares types -> web
remote:
remote: -----> Compressing...
remote: Done: 73.7M
remote: -----> Launching...
remote: Released v24
remote: https://brocr.herokuapp.com/ deployed to Heroku
remote:
remote: Verifying deploy... done.
To https://git.heroku.com/brocr.git
c7866b8..961e948 master -> master
目录中ls -as
命令的输出:
$TESSDATA_PREFIX
Here is the compile script(用于自定义构建包)。 我想念什么吗?