Question

我需要使用scipy来最小化上述功能。

我的输入数据

np.random.seed(1234)
m = 500    #500
n = 100    #100
A = np.asmatrix(np.random.randint(low=-10,high=1,size=(n,m)))     
b = np.asmatrix(np.random.randint(low=500,high=1000,size=(m,1)))  
c = np.asmatrix(np.random.randint(low=0.1,high=2,size=(n,1)))
x = np.asmatrix(np.random.randint(low=1,high=10,size=(n,1)))

我的功能和渐变：

def func(x, A, b, c):
    fx = np.dot(c.T, x) - np.sum(np.log10((b - np.dot(A.T, x))))
    return fx

def grad(x, A, b, c):
    gradient = A * (1.0/(b - A.transpose()*x)) + c
    return gradient

这是我试图运行的

scipy.optimize.fmin_cg(func(x + t*grad(x, A, b, c),A, b, c), x, args=(A,b,c,x))

Answer 1

我不确定你用

做什么

func(x + t*grad(x, A, b, c), A, b, c)

什么是t？

无论如何，您对fmin_cg的来电不正确 - fmin_cg的签名是

fmin_cg(f, x0, fprime=None, args=(), ...)

第一个参数需要是你的目标函数func，第二个参数需要是x的初始猜测，第三个（可选）参数是你的渐变函数，grad ，第四个是f和fprime的附加参数集（不包括 x）。

电话应该如下：

scipy.optimize.fmin_cg(func, x, fprime=grad, args=(A, b, c))

但是，由于数组维度存在问题，这仍然无法正常工作：

<ipython-input-49-bf5fa71345fe> in grad(x, A, b, c)
      1 def grad(x, A, b, c):
----> 2         gradient = A * (1.0/(b - A.transpose()*x)) + c
      3         return gradient
      4 

/home/alistair/.venvs/core/local/lib/python2.7/site-packages/numpy/matrixlib/defmatrix.pyc in __mul__(self, other)
    341         if isinstance(other, (N.ndarray, list, tuple)) :
    342             # This promotes 1-D vectors to row vectors
--> 343             return N.dot(self, asmatrix(other))
    344         if isscalar(other) or not hasattr(other, '__rmul__') :
    345             return N.dot(self, other)

ValueError: shapes (500,100) and (1,100) not aligned: 100 (dim 1) != 1 (dim 0)

要弄清楚为什么会发生这种情况，我们可以在grad中设置一个断点：

import pdb

def grad(x, A, b, c):
    pdb.set_trace()
    gradient = A * (1.0/(b - A.transpose()*x)) + c
    return gradient

在第一次致电grad时，我们会看到：

(Pdb) type(x)
<type 'numpy.ndarray'>
(Pdb) !x.shape
(100,)

fmin_cg内的某处，x正在从(100, 1) np.matrix转换为(100,) 1D np.ndarray。对于np.ndarray，*运算符执行元素乘法而不是矩阵乘法，这将失败，因为x和A.transpose()具有不兼容的维度。

基本上，你正在遇到这样一个事实：np.matrix并不完全支持numpy和scipy中许多期望np.ndarray的函数。我强烈建议您从使用np.matrix切换到np.ndarray - 使用np.matrix为officially discouraged，并且可能会在不久的将来弃用。

您的grad功能可以改写为：

def grad(x, A, b, c):
    gradient = A.dot(1.0/(b - A.T.dot(x))) + c
    return gradient

...和你的初始论点：

np.random.seed(1234)
m = 500    #500
n = 100    #100
A = np.random.randint(low=-10, high=1, size=(n, m))
b = np.random.randint(low=500, high=1000, size=m) 
c = np.random.randint(low=0.1, high=2, size=n)
x = np.random.randint(low=1, high=10, size=n)

...现在您对fmin_cg的致电应该有效：

res = scipy.optimize.fmin_cg(func, x, fprime=grad, args=(A, b, c))

使用scipy.optimize最小化函数

1 个答案: