Question

scipy稀疏矩阵类型与普通numpy矩阵类型之间似乎存在一些差异

import scipy.sparse as sp
A = sp.dia_matrix(tri(3,4))
vec = array([1,2,3,4])

print A * vec                        #array([ 1.,  3.,  6.])

print A * (mat(vec).T)               #matrix([[ 1.],
                                     #        [ 3.],
                                     #        [ 6.]])

print A.todense() * vec              #ValueError: matrices are not aligned

print A.todense() * (mat(vec).T)     #matrix([[ 1.],
                                     #        [ 3.],
                                     #        [ 6.]])

为什么稀疏矩阵可以解释当正常矩阵不能解释时，数组应该被解释为列向量？

Answer 1

在spmatrix类（您可以在scipy / sparse / base.py中查看）__mul__()，有一组“ifs”可以回答您的问题：

class spmatrix(object):
    ...
    def __mul__(self, other):
        ...
        M,N = self.shape
        if other.__class__ is np.ndarray:
            # Fast path for the most common case
            if other.shape == (N,):
                return self._mul_vector(other)
            elif other.shape == (N, 1):
                return self._mul_vector(other.ravel()).reshape(M, 1)
            elif other.ndim == 2  and other.shape[0] == N:
                return self._mul_multivector(other)

对于1D数组，它总是从_mul_vector() compressed.py转到_cs_matrix内，使用下面给出的代码：

def _mul_vector(self, other):
    M,N = self.shape

    # output array
    result = np.zeros(M, dtype=upcast_char(self.dtype.char,
                                           other.dtype.char))

    # csr_matvec or csc_matvec
    fn = getattr(sparsetools,self.format + '_matvec')
    fn(M, N, self.indptr, self.indices, self.data, other, result)

    return result

请注意，它假定输出具有稀疏矩阵的行数。基本上，它将输入的1D数组视为符合稀疏数组的列数（没有转置或非转置）。但对于ndim==2的ndarray，它不能做这样的假设，所以如果你尝试过：

vec = np.array([[1,2,3,4],
                [1,2,3,4]])

A * vec.T将是唯一有效的选项。

对于1D矩阵，稀疏模块也不假设它符合列数。要检查您是否可以尝试：

A * mat(vec)
#ValueError: dimension mismatch

而A * mat(vec).T将是您唯一的选择。

Numpy / Scipy稀疏与密集乘法

1 个答案: