Question

我正在尝试使用Google OCR从图像中提取文本，但是这里的问题是，如果文本是其他语言（在我的情况下为印度语），内容会自动替换为英语中的一些随机文本。

这是我正在尝试的代码。

const vision = require('@google-cloud/vision');
const fs = require('fs');
function exportDetections(fileName, callback) {
    console.log(fileName);
    const client = new vision.ImageAnnotatorClient({
        keyFilename: myKeyJson,

    });

    const request = {
        image: {
            content: fs.readFileSync(fileName),
        },
        "imageContext": {
            "languageHints": [
                "en"
            ]
        }
    };
    client
        .documentTextDetection(request)
        .then(results => {
            const fullTextAnnotation = results[0].fullTextAnnotation;
            console.log(`Full text: ${fullTextAnnotation.text}`);
        })
        .catch(err => {
            console.error('ERROR:', err);
        });
}

exportDetections('./temp/Temp.png');

这是我的ImageFile

最后这是我收到的输出。

./temp/Temp.png
Full text: 311901 31TER shaich / Your Aadhaar No. :
2094 7051 9541
3TET - 30TH HIGH IT HITTAR
X-
HR HR
Government of India
THIET G
Hanumana Ji
ru fata / DOB: 01/01/1959
967/ Male
ORXAN

但是我的预期输出是

./temp/Temp.png
Full text: Your Aadhaar No. :
2094 7051 9541
Government of India
Hanumana Ji
DOB: 01/01/1959
Male

请让我知道我要去哪里哪里以及如何解决此问题。

谢谢

Google Vision OCR替换英语以外的其他内容

0 个答案: