Question

我已经使用Forms Recognizer几天了，但无法识别表单中的键。

我想用它来提取学生在测试中给出的答案... here是一个例子。

我无法更改学生填写的工作表的结构，因为这是一次国家考试，而且我无法访问组织它的人。

因此，我按照Microsoft文档中的建议训练了一个模型，并使用该模型“读取”了表单，并获得了大多数答案，但这全都是键“ 令牌”的值< / p>

{
    "key": [
      {
        "text": "__Tokens__",
        "boundingBox": [
          0,
          0,
          0,
          0,
          0,
          0,
          0,
          0
        ]
      }
    ],
    "value": [
      {
        "text": "01",
        "boundingBox": [
          110.1,
          826.6,
          125.6,
          826.6,
          125.6,
          816.8,
          110.1,
          816.8
        ],
        "confidence": 1
      },
      {
        "text": "A",
        "boundingBox": [
          148.2,
          834.4,
          160.6,
          834.4,
          160.6,
          816.8,
          148.2,
          816.8
        ],
        "confidence": 1
      },
      {
        "text": "26",
        "boundingBox": [
          229.4,
          828.6,
          246,
          828.6,
          246,
          816.8,
          229.4,
          816.8
        ],
        "confidence": 1
      },
      {
        "text": "B",
        "boundingBox": [
          268.6,
          834.4,
          277.8,
          834.4,
          277.8,
          816.8,
          268.6,
          816.8
        ],
        "confidence": 1
      }

然后我在excel上重新创建了结构，但在数字后添加了：并训练了另一个模型。我还打印了一些副本，并进行了测试，然后Form Recognizer将数字理解为键。

{
      "key": [
        {
          "text": "01:",
          "boundingBox": [
            270.4,
            1625.4,
            313,
            1625.4,
            313,
            1600.5,
            270.4,
            1600.5
          ]
        }
      ],
      "value": [
        {
          "text": "A",
          "boundingBox": [
            350.7,
            1620.9,
            368.8,
            1620.9,
            368.8,
            1587,
            350.7,
            1587
          ],
          "confidence": 1
        }
      ]
    },
    {
      "key": [
        {
          "text": "26:",
          "boundingBox": [
            520.2,
            1624.2,
            552.8,
            1624.2,
            552.8,
            1600.5,
            520.2,
            1600.5
          ]
        }
      ],
      "value": [
        {
          "text": "E",
          "boundingBox": [
            604.6,
            1618.8,
            625.8,
            1618.8,
            625.8,
            1587,
            604.6,
            1587
          ],
          "confidence": 1
        }
      ]
    }

有人知道不带：来将数字字段识别为键的某种方法吗？

Answer 1

Form Recognizer不会将行号视为键，除非专门将其标记为键，因此当前不会将其发现为键。

没有：： Forms Recognizer无法识别字段

1 个答案: