我正在制作一个项目,该项目计算旧金山每个街区的犯罪总数,我使用了以下代码,但是当我尝试使用Crime_counts.dtypes进行验证时,它无法将街区列识别为一列
import pandas as pd
file_name='https://cocl.us/sanfran_crime_dataset'
df=pd.read_csv(file_name)
crime_count = df["PdDistrict"].value_counts().to_frame('C').rename_axis('Neighbourhood')
crime_count = crime_count.rename(columns = {"PdDistrict":"Neighbourhood"})
crime_count = crime_count.rename(columns = {"C":"Counts"})
crime_count
我想创建一个数据框,该框将邻居和计数都识别为列。
答案 0 :(得分:1)
首先通过rename_axis
更改索引名称,然后通过Series.reset_index
用参数Series
将DataFrame
转换为两列name
:
crime_count = (df["PdDistrict"].value_counts()
.rename_axis('Neighbourhood')
.reset_index(name='Counts'))
print (crime_count)
Neighbourhood Counts
0 SOUTHERN 28445
1 NORTHERN 20100
2 MISSION 19503
3 CENTRAL 17666
4 BAYVIEW 14303
5 INGLESIDE 11594
6 TARAVAL 11325
7 TENDERLOIN 9942
8 RICHMOND 8922
9 PARK 8699