操作数不能与形状(100,)(8,8)一起广播

时间:2020-05-14 20:48:53

标签: python

我正在制作Baysien II型模型,以寻找最大可能性来估计超参数的“最可能”值。在尝试运行下面的代码后,我在compute_posterior函数中看到了一些错误。形状可能有问题。

def compute_posterior(PHI, t, alph, s2):
    M = PHI.shape[1]
    beta = 1/s2
    H = beta*(PHI.T @ PHI) + alph*np.eye(M)
    SIGMA = np.linalg.inv(H)
    Mu = beta * (SIGMA @ (PHI.T @ t))
    #
    return Mu, SIGMA


# Marginal log likelihood
#
# Version 1 Log Marginal (ideal)
#
def compute_log_marginal(PHI, t, alph, s2):
    #
    # Exploit the shape of C and the fact that M < N (usually)
    #
    N, M = PHI.shape
    beta = 1 / s2
    Mu, SIGMA = compute_posterior(PHI, t, alph, s2)
    #
    # Constant factor
    #
    logML = -N * np.log(2 * np.pi)
    #
    # log determinant factor (log|C|)
    #
    # If SIGMA becomes singular, sgn<0
    #
    sgn, logdet = np.linalg.slogdet(SIGMA)
    #
    if sgn < 0:
        print("Error with alpha={0}, s2={1}".format(alph, s2))
        raise np.linalg.LinAlgError("logdet sign is negative - something is wrong!")
    #
    logML += logdet + N*np.log(beta) + M*np.log(alph)
    #
    # data term (t'Ct)
    #
    logML -= beta * (t.T @ (t - PHI @ Mu))
    #
    logML = logML / 2.0
    #
    return logML

log_alph = np.linspace(-3, 6, n)
log_s2 = np.linspace(-4, 4, n)

x = df.values[:, 0:8]
t = df.values[:, 8]

n  = 100

Z = np.empty((n, n))
Z_max = []
al = []
rr = []
for i, a in enumerate(log_alph):
    for j, r in enumerate(log_s2):
        Z[i, j] = compute_log_marginal(x,t,log_alph,log_s2)
        Z_max.append(Z[i, j])
        maximum = max(Z_max)
        print(maximum)

#         if Z[i, j] == -11.627510277032485:
#             al.append(a)
#             rr.append(r)

# maximum = max(Z_max)
# print(al)
# print(rr)
# print(maximum)
# maximum = -11.627510277032485



plt.contourf(log_a, log_r, Z.T)
print('The maximum point is:',maximum )
print('The max log_alpha is:',al[0] )
print('The max log_r  is:',rr[0] )
plt.xlabel('log alpha')
plt.ylabel('log r')
plt.title('Contour of log alpha and log r')

编译后出现此错误,我仍然不知道如何解决

'
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-260-c7f0396b4046> in <module>
     14 for i, a in enumerate(log_alph):
     15     for j, r in enumerate(log_s2):
---> 16         Z[i, j] = compute_log_marginal(x,t,log_alph,log_s2)
     17         Z_max.append(Z[i, j])
     18         maximum = max(Z_max)

<ipython-input-233-45cfae272a38> in compute_log_marginal(PHI, t, alph, s2)
     19     N, M = PHI.shape
     20     beta = 1 / s2
---> 21     Mu, SIGMA = compute_posterior(PHI, t, alph, s2)
     22     #
     23     # Constant factor

<ipython-input-233-45cfae272a38> in compute_posterior(PHI, t, alph, s2)
      2     M = PHI.shape[1]
      3     beta = 1/s2
----> 4     H = beta*(PHI.T @ PHI) + alph*np.eye(M)
      5     SIGMA = np.linalg.inv(H)
      6     Mu = beta * (SIGMA @ (PHI.T @ t))

ValueError: operands could not be broadcast together with shapes (100,) (8,8) 

'

1 个答案:

答案 0 :(得分:0)

H = beta*(PHI.T @ PHI) + alph*np.eye(M)
在这一行中的

拆分操作,找出哪一个引发了错误 如果您的逻辑还可以,请检查所有矩阵是否都具有您期望的形状。 如果形状可以,请注意@是矩阵乘法,*是元素乘法。 在发布代码的Nex时间中,尝试包含更易读的较小部分,或者包含每个人都将能够执行的代码进行调试。