Question

您好我正在尝试了解回归算法，并尝试使用渐变下降实现线性回归，并使用残差平方和来确定收敛。我注意到在迭代中的某些点上对剩余平方和的评估，我认为这是有道理的，但我不知道如何解决这个问题。我做错了什么？

import math
import numpy as num

def get_regression_predictions(input_feature, intercept, slope):
   predicted_output = [intercept + xi*slope for xi in input_feature]
   return(predicted_output)

def get_residual_sum_of_squares(input_feature, output, intercept,slope):
   return num.sum( [(output.iloc[i] - (intercept + slope*input_feature.iloc[i]))**2 for i in range(0,len(output))] )

def train(input_feature,output,intercept,slope):
#the start value of intercept and slope are 0
    last = 0
    now = math.sqrt(get_residual_sum_of_squares(input_feature,output,intercept,slope))

    while abs(last - now) >= 0.01:
        last = now
        predictions = get_regression_predictions(input_feature,intercept,slope)
        errors = [output.iloc[i] - predictions[i] for i in range(0,len(predictions))]


        adjustements = (sum(errors)*0.05,sum([errors[i]*output.iloc[i] for i in range(0,len(errors))] ) *0.05)

        intercept ,slope = (intercept - adjustements[0],slope - adjustements[1] )
        now = math.sqrt(get_residual_sum_of_squares(input_feature,output,intercept,slope))

 return intercept,slope

Answer 1

尝试随机梯度下降。您可能总结了许多导致溢出的实例错误。

具有梯度下降的线性回归，剩余的平方和溢出

1 个答案: