Question

我正在学习pandas的人。我从联合国数据中下载了一个.csv文件进行研究。

http://data.un.org/Data.aspx?q=gdp&d=SNAAMA&f=grID%3a101%3bcurrID%3aUSD%3bpcFlag%3a1

但是，此CSV文件的格式不是我所期望的。看起来像下面。

Country or Area Year    Item    Value

0   Afghanistan 2016    Gross Domestic Product (GDP)    583.882867

1   Afghanistan 2015    Gross Domestic Product (GDP)    610.854517

2   Afghanistan 2014    Gross Domestic Product (GDP)    651.158326

3   Afghanistan 2013    Gross Domestic Product (GDP)    681.033974

4   Afghanistan 2012    Gross Domestic Product (GDP)    694.885886


...

9441    Zimbabwe    1974    Gross Domestic Product (GDP)    721.921573

9442    Zimbabwe    1973    Gross Domestic Product (GDP)    620.261471

9443    Zimbabwe    1972    Gross Domestic Product (GDP)    521.701845

9444    Zimbabwe    1971    Gross Domestic Product (GDP)    437.030171

9445    Zimbabwe    1970    Gross Domestic Product (GDP)    390.776187

我想在一行中看到每个国家/地区的数据，而年份要转到列名。

有什么办法吗？

Answer 1

您可以尝试unstack

df.set_index(['Country or Area', 'Year'])['Value'].unstack()

熊猫数据框改造

1 个答案: