GetComponentImages示例

Pix *image = pixRead("/usr/src/tesseract/testing/phototest.tif");
  tesseract::TessBaseAPI *api = new tesseract::TessBaseAPI();
  api->Init(NULL, "eng");
  api->SetImage(image);
  Boxa* boxes = api->GetComponentImages(tesseract::RIL_TEXTLINE, true, NULL, NULL);
  printf("Found %d textline image components.\n", boxes->n);
  for (int i = 0; i < boxes->n; i++) {
    BOX* box = boxaGetBox(boxes, i, L_CLONE);
    api->SetRectangle(box->x, box->y, box->w, box->h);
    char* ocrResult = api->GetUTF8Text();
    int conf = api->MeanTextConf();
    fprintf(stdout, "Box[%d]: x=%d, y=%d, w=%d, h=%d, confidence: %d, text: %s",
                    i, box->x, box->y, box->w, box->h, conf, ocrResult);
  }

包含fprintf()以打印信箱和信心信息。

希望得到这个帮助。

修改

要从CLI获取置信度（conf）值以及边界框（{{1}}，left，top，width），请设置{{ 1}}输出为height格式。以下是输出文件名为tesseract的示例命令。

tsv

以下是Excel查看的test.tsv输出文件。

您可以参考此tesseract wiki了解详情。

Tesseract命令行界面：获得每个角色的识别置信度

1 个答案:

GetComponentImages示例