Question

我对预测（例如，使用卡尔曼滤波器）和预测（例如，使用SGDRegressor或Logistic回归）感到困惑。

我有一个数据集，其中包含一个月（MAY）内每小时的登录次数。我的变量是：_time和登录名。我使用了SGDRegressor并保存了模型（5月）。然后，我将保存的模型与JUNE数据一起应用（这是看不见的数据）。结果非常好（我不知道是否过拟合）。但正如我所说。对于使用Kalman进行预测和使用SGDRegressor进行预测（单变量），我感到困惑。

有什么帮助吗？

FORECASTING WITH KALMAN
Here, I am using 36 hours as test and I am forecasting 36 hours

| inputlookup fortigate_QC_May2019_logins.csv
| fit StandardScaler "logins" with_mean=false with_std=true
| fit DBSCAN "SS_logins"
| where NOT cluster==-1
| predict "SS_logins" as prediction algorithm=LLP5 holdback=36 future_timespan=72 upper95=upper95 lower95=lower95 
| `forecastviz(72, 36, "SS_logins", 95)`



PREDICTING WITH SGDREGRESSOR
Here, I am calling a saved model (using MAY) and applying it with JUNE.
| inputlookup fortigate_QC_June2019_logins.csv
| fit StandardScaler "logins" with_mean=false with_std=true
| apply "authentication_profiling_SGDRegressor"
| table _time, "SS_logins", "predicted(SS_logins)"

是否可以使用仅2个变量的回归算法进行预测？

0 个答案: