ValueError:压缩距离矩阵必须仅包含有限值

时间:2020-09-23 23:34:53

标签: python hierarchical-clustering

我正在尝试聚类并不断获取“ ValueError:压缩距离矩阵必须仅包含有限值。”

我尝试了下面的代码,但无法解决问题。

df.fillna(0)
df.replace(np.nan, 0)
plt.figure(figsize=(10,3))
%matplotlib inline
plt.style.use('seaborn-whitegrid')
columns = ['hr_starting','coal','gas','hydro','nuclear','oil','other','solar','storage','wind','flywheel','total_gen','thermal_load','total_renewables','gas_percentage_of_thermal_load','philadelphia','congestion_price','energy_price','lmp_price','marginal_loss_price']
x = df[['coal','gas','hydro','nuclear','oil','other','solar','storage','wind','total_gen','thermal_load','total_renewables','congestion_price','energy_price','lmp_price','marginal_loss_price']]

y = df['total_gen']

linked = linkage(x, 'single')

labelList = range(1, 11)

plt.figure(figsize=(10, 7))
dendrogram(linked,
            orientation='top',
            labels=labelList,
            distance_sort='descending',
            show_leaf_counts=True)

plt.xlabel('cluster size')
plt.ylabel('distance')
plt.show()```

0 个答案:

没有答案