Question

我使用Google Vision API OCR（文档文本检测）从扫描的文档（base64字符串）中获取文本。它仅适用于一张图像。但是我如何发送多个图像，例如文档的第二页。

我尝试合并base64字符串，但是它不起作用。

@NgModule({
     providers: [tokenInterceptorProvider]
})

Answer 1

Cloud Vision API具有方法files.asyncBatchAnnotate。这样就可以在同一请求中发送一堆文件。要添加单个文件，请使用async file annotation个请求。在批处理请求中包含两个图像的示例如下：

{
  "requests":[
    {
      "inputConfig": {
        "gcsSource": {
          "uri": "gs://<your bucket name>/image1.jpg"
        },
        "mimeType": "image/jpg"
      },
      "features": [
        {
          "type": "DOCUMENT_TEXT_DETECTION"
        }
      ],
      "outputConfig": {
        "gcsDestination": {
          "uri": "gs://<your bucket name>/output/"
        }
      }
    },
    {
      "inputConfig": {
        "gcsSource": {
          "uri": "gs://<your bucket name>/image2.jpg"
        },
        "mimeType": "image/jpg"
      },
      "features": [
        {
          "type": "DOCUMENT_TEXT_DETECTION"
        }
      ],
      "outputConfig": {
        "gcsDestination": {
          "uri": "gs://<your bucket name>/output/"
        }
      }
    }
  ]
}

如果您专门使用pdf文件，我发现this post解释了如何同时使用asyncBatchAnnotate发送请求。

Google Vision API文档以base64字符串文本多个图像

1 个答案: