Question

我正在使用聚类标准错误（https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.RegressionResults.get_robustcov_results.html）使用statsmodel运行OLS回归：

import statsmodels.api as sm
import statsmodels.formula.api as smf

model = smf.ols(model_string, data=df1).fit(cov_type = 'cluster', cov_kwds={'groups': df1['correctedSE']})

我运行了一个具有不同规格的循环，并希望可视化所有迭代的系数。

我在每次迭代中使用以下方法提取系数和标准误差：

coe = model.params["variable_of_interest"] 
se = model.bse["variable_of_interest"]
List_coefficients.append(coe)
List_standard_errors.append(se)

我从列表中创建了一个简单的数据框，然后希望在matplotlib的错误栏中显示系数：

import matplotlib
import matplotlib.pyplot as plt
ax = plt.errorbar(df["loop_rounds"], df["Coefficient"], yerr=df150["Standard_e"])

但是，这样做时，置信区间的计算方式有所不同，并且图中的某些系数当前很重要。

什么是校正计算差异的最佳方法（例如，从statsmodel中提取不同的参数，手动调整标准误差，更改误差栏）？如果手动编辑是解决方案，那么在哪里可以找到聚类标准错误的公式？

谢谢

重现统计模型计算

0 个答案: