Question

我正在关注本教程：https://cloud.ibm.com/docs/services/visual-recognition?topic=visual-recognition-tutorial-recognize-text&locale=en-US#pr-ximos-passos

我的目标是阅读文档并制成目录。内容的类型为KEY-VALUE，例如“ VALUE 10.00” 。我可以提取图像的文本，但不能提取数字。

将问题具体化：

我正在使用这张图片

必须提取的值：

DATA                        13/06/2016
AGENCIA/CASH                0180/2009
VALOR DEPOSITO EM DINHEIRO  50.00

但是当我使用对Visual Recognition服务的Follow curl调用时：

curl -u "apikey:{API_KEY}" --form "images_file=@teste1.png" "https://gateway.watsonplatform.net/visual-recognition/api/v3/recognize_text?version=2018-03-19" -k

结果（一张）：

        "text": "data gigolo hora\nman/em 251\nnumero envelope 689 574\nvalor depusitd eh 4\ncpf no defusnantez 614 220\ndata lananzmnz",
        "words": [
            {
                "word": "data",
                "location": {
                    "height": 18,
                    "width": 40,
                    "left": 13,
                    "top": 10
                },
                "score": 0.6098,
                "line_number": 0
            },
            {
                "word": "gigolo",
                "location": {
                    "height": 43,
                    "width": 57,
                    "left": 146,
                    "top": 0
                },
                "score": 0.4283,
                "line_number": 0
            },
            {
                "word": "hora",
                "location": {
                    "height": 18,
                    "width": 39,
                    "left": 249,
                    "top": 11
                },
                "score": 0.6533,
                "line_number": 0
            },
            {
                "word": "man/em",
                "location": {
                    "height": 17,
                    "width": 72,
                    "left": 127,
                    "top": 35
                },
                "score": 0.8187,
                "line_number": 1
            },
            {
                "word": "251",
                "location": {
                    "height": 21,
                    "width": 30,
                    "left": 294,
                    "top": 33
                },
                "score": 0.9881,
                "line_number": 1
            },
            {
                "word": "numero",
                "location": {
                    "height": 21,
                    "width": 54,
                    "left": 12,
                    "top": 52
                },
                "score": 0.9116,
                "line_number": 2
            },

请注意，某些单词可以很好地提取，但数字不能，我的主要目标是提取货币值和日期。

要创建我的表，我可以使用"height"属性来知道哪个是您各自的数值。

那么，我如何提取数字？

PS .：这是葡萄牙语（BR）文档。

Answer 1

感谢您对这项服务的关注...但是，正如今天所提供的那样，此Beta服务主要是接受英语词典的培训。尽管它可以识别短的数字字符串，但是在读取任意数字（例如价格，序列号或车牌）之类的任务上效果不佳。另外，可能不会找到巴西葡萄牙语单词。

如何使用IBMCloud Visual Recognition Text提取图像编号？

1 个答案: