Question

我正在尝试获取一些代码，这些代码将在图像上执行透视变换（在本例中为3d旋转）。

import os.path
import numpy as np
import cv

def rotation(angle, axis):
    return np.eye(3) + np.sin(angle) * skew(axis) \
               + (1 - np.cos(angle)) * skew(axis).dot(skew(axis))

def skew(vec):
    return np.array([[0, -vec[2], vec[1]],
                     [vec[2], 0, -vec[0]],
                     [-vec[1], vec[0], 0]])

def rotate_image(imgname_in, angle, axis, imgname_out=None):
    if imgname_out is None:
        base, ext = os.path.splitext(imgname_in)
        imgname_out = base + '-out' + ext
    img_in = cv.LoadImage(imgname_in)
    img_size = cv.GetSize(img_in)
    img_out = cv.CreateImage(img_size, img_in.depth, img_in.nChannels)
    transform = rotation(angle, axis)
    cv.WarpPerspective(img_in, img_out, cv.fromarray(transform))
    cv.SaveImage(imgname_out, img_out)

当我围绕z轴旋转时，一切都按预期工作，但绕x或y轴旋转似乎完全关闭。在开始获得看似合理的结果之前，我需要以小到pi / 200的角度旋转。知道什么可能是错的吗？

Answer 1

首先，构建形式为

的旋转矩阵

    [cos(theta)  -sin(theta)  0]
R = [sin(theta)   cos(theta)  0]
    [0            0           1]

应用此坐标变换可以围绕原点旋转。

相反，如果要围绕图像中心旋转，则必须先移动图像中心到原点，然后应用旋转，然后将所有东西都移回去。你可以用一个翻译矩阵：

    [1  0  -image_width/2]
T = [0  1  -image_height/2]
    [0  0   1]

然后，用于平移，旋转和反向平移的变换矩阵变为：

H = inv(T) * R * T

我将不得不考虑如何将偏斜矩阵与3D变换联系起来。我希望最简单的方法是设置一个4D变换矩阵，然后将其投影回2D齐次坐标。但就目前而言，偏斜矩阵的一般形式是：

    [x_scale 0       0]
S = [0       y_scale 0]
    [x_skew  y_skew  1]

x_skew和y_skew值通常很小（1e-3或更低）。

以下是代码：

from skimage import data, transform
import numpy as np
import matplotlib.pyplot as plt

img = data.camera()

theta = np.deg2rad(10)
tx = 0
ty = 0

S, C = np.sin(theta), np.cos(theta)

# Rotation matrix, angle theta, translation tx, ty
H = np.array([[C, -S, tx],
              [S,  C, ty],
              [0,  0, 1]])

# Translation matrix to shift the image center to the origin
r, c = img.shape
T = np.array([[1, 0, -c / 2.],
              [0, 1, -r / 2.],
              [0, 0, 1]])

# Skew, for perspective
S = np.array([[1, 0, 0],
              [0, 1.3, 0],
              [0, 1e-3, 1]])

img_rot = transform.homography(img, H)
img_rot_center_skew = transform.homography(img, S.dot(np.linalg.inv(T).dot(H).dot(T)))

f, (ax0, ax1, ax2) = plt.subplots(1, 3)
ax0.imshow(img, cmap=plt.cm.gray, interpolation='nearest')
ax1.imshow(img_rot, cmap=plt.cm.gray, interpolation='nearest')
ax2.imshow(img_rot_center_skew, cmap=plt.cm.gray, interpolation='nearest')
plt.show()

输出：

Rotations of cameraman around origin and center+skew

Answer 2

我没有得到你构建旋转矩阵的方法。这对我来说似乎相当复杂。通常，它将通过构建零矩阵来构建，将1放在不需要的轴上，将公共sin，cos，-cos，sin放入两个使用的尺寸。然后将所有这些相乘。

你从哪里获得np.eye(3) + np.sin(angle) * skew(axis) + (1 - np.cos(angle)) * skew(axis).dot(skew(axis))构造？

尝试从基本构建块构建投影矩阵。构造旋转矩阵相当容易，“rotationmatrix dot skewmatrix”应该可以工作。

您可能需要注意旋转中心。您的图像可能位于z轴上的虚拟位置1处，因此通过在x或y上旋转，它会移动一点。所以你需要使用翻译，因此z变为0，然后旋转，然后翻译。（仿射坐标中的平移矩阵也非常简单。请参阅维基百科：https://en.wikipedia.org/wiki/Transformation_matrix）

在图像上的3d旋转

2 个答案: