Question

我是OpenCV的新手，我想要修改一个有歪斜文字的图像。首先我用GrayScale读取图像并将其二值化，然后我尝试this：

import cv2
import numpy as np

img = cv2.imread('m20.jpg',0)
ret,byw = cv2.threshold(img,127,255,cv2.THRESH_BINARY_INV)
_, contours, hierarchy = cv2.findContours(byw.copy(), cv2.RETR_TREE,cv2.CHAIN_APPROX_SIMPLE)
cnt = contours[0]
draw = cv2.cvtColor(byw, cv2.COLOR_GRAY2BGR)
rect = cv2.minAreaRect(cnt)
box = cv2.boxPoints(rect)
box = np.int0(box)
cv2.drawContours(draw, [box], 0, (0, 255, 0), 2)

但是因为findContours（）期望接收具有体形的图像，所以不起作用。我尝试的其他方式是翻译这个c ++代码：

// Read image
Mat3b img = imread("path_to_image");

// Binarize image. Text is white, background is black
Mat1b bin;
cvtColor(img, bin, COLOR_BGR2GRAY);

bin = bin < 200;

// Find all white pixels
vector<Point> pts;
findNonZero(bin, pts);

// Get rotated rect of white pixels
RotatedRect box = minAreaRect(pts);
if (box.size.width > box.size.height)
{
    swap(box.size.width, box.size.height);
    box.angle += 90.f;
}

Point2f vertices[4];
box.points(vertices);

for (int i = 0; i < 4; ++i)
{
    line(img, vertices[i], vertices[(i + 1) % 4], Scalar(0, 255, 0));
}

// Rotate the image according to the found angle
Mat1b rotated;
Mat M = getRotationMatrix2D(box.center, box.angle, 1.0);
warpAffine(bin, rotated, M, bin.size());

我有这个：

draw = cv2.cvtColor(byw, cv2.COLOR_GRAY2BGR)
data = np.array(byw)
subzero = np.nonzero(data)
subuno = np.reshape(subzero,(17345,2)) # this is because cv2.minAreaRect() receives a Nx2 numpy
rect = cv2.minAreaRect(subuno)
box = cv2.boxPoints(rect)
box = np.int0(box)
cv2.drawContours(draw,[box],0,(0,255,0),2)

但是再次结果不是预期的，矩形没有很好的定位。

<小时/> 我也可能试图在C ++中使for像，但我不知道如何从中获取顶点 box = cv2.boxPoints(rect)。请帮忙！

Answer 1

也许你可以看一下：http://www.pyimagesearch.com/2014/08/25/4-point-opencv-getperspective-transform-example/

在该链接中，作者对整个文档（以及所包含的文本）进行了修改或转换，但是，它取决于根据图像中找到的轮廓查找文档的边缘。

他在以下教程中进一步了解：http://www.pyimagesearch.com/2014/09/01/build-kick-ass-mobile-document-scanner-just-5-minutes/

他的解决方案有效，因为他可以根据检测到的文档的位置，方向和偏斜来调整整个文档。实际上，调整文档的整体位置可以调整文档中的所有内容，包括文本。

但是，我相信您要问的是，即使没有检测到任何文档边缘和轮廓，您也希望校正文本。如果是这种情况，那么我假设您需要提供另一个基础或标准来基于您的文本纠偏（即检测到图像中有字母，然后检测字母是否有偏差）根据你的标准，然后调整字母），这可能是一个非常重要的练习。

使用OpenCV和Python歪斜文本（RotatedRect，minAreaRect）

1 个答案: