Question

我正在使用cython对矩形矩阵A进行一级更新。我不能让dger按照我的意愿进行更新，所以我已经在一个函数中隔离了它：

from scipy.linalg.cython_blas cimport dger
cimport cython

def test_dger(double[:, :] A, double[:] x, double[:] y):
    cdef int inc = 1
    cdef double one = 1
    cdef int n_= A.shape[0]
    cdef int m = A.shape[1]
    dger(&n, &m, &one, &x[0], &inc, &y[0], &inc, &A[0, 0], &n)
    return np.array(A)

编译得很好。但是，做：

n = 3
m = 4
A = np.zeros([n, m])
y = np.arange(m, dtype=float)
x = np.array([1., 4, 0])
test_dger(A, x, y)

给了我

array([[  0.,   0.,   0.,   1.],
       [  4.,   0.,   2.,   8.],
       [  0.,   3.,  12.,   0.]])

具有所需的n×m形状，但值的顺序错误。我认为C命令与Fortran命令有关，但我自己无法解决问题。

我期待的结果是

给出的结果

np.dot(x[:, None], y[None, :])
array([[  0.,   1.,   2.,   3.],
       [  0.,   4.,   8.,  12.],
       [  0.,   0.,   0.,   0.]])

Answer 1

这确实是 C 与 Fortran 命令。由于 Fortran 顺序中的矩阵 A 维数是 4x3，因此 x 和 y 应该交换。解决办法是：

cimport cython
from scipy.linalg.cython_blas cimport dger

def test_dger(double[:, :] A, double[:] y, double[:] x):
    # x and y swapped!
    cdef int inc = 1
    cdef double one = 1.0
    cdef int m = A.shape[0] # Transposed shape!
    cdef int n = A.shape[1] # Transposed shape!
    dger(&n, &m, &one, &x[0], &inc, &y[0], &inc, &A[0, 0], &n)
    return A

现在它工作得很好：

n, m = 3, 4
A = np.zeros([n, m])
x = np.array([1., 4, 0], dtype=float)
y = np.arange(m, dtype=float)

test_dger(A, y, x)
A
array([[ 0.,  1.,  2.,  3.],
       [ 0.,  4.,  8., 12.],
       [ 0.,  0.,  0.,  0.]])

BLAS dger填充矩阵的命令？

1 个答案: