排序和排序列Python

时间:2017-12-11 14:19:53

标签: python python-2.7 csv sorting

我有一个代码可以使用其他CSV文件中的信息创建CSV。在我的新CSV文件中,我只想保存从row ['impressions']

的最高到最低排序的20行

我读了一些关于熊猫的事,但我找不到任何关于如何做的事情!

为了更清楚,我分享了一些图片:

之前: enter image description here

后: enter image description here

代码:

import csv
input_file = 'report_2017_12_11_12_31_19UTC.csv'
output_file= "All_Data_Tags.csv"

with open(input_file) as csvfile, open(output_file,  "w") as output:
    reader = csv.DictReader(csvfile)
    cols = ("domain","ddomain","opportunities", "impressions", "fillRate", "DATA")
    writer = csv.DictWriter(output, fieldnames=cols, extrasaction='ignore')

    writer.writeheader()
    for row in reader:
        row['fillRate'] = '{:.2f}'.format(float(row['fillRate']) * 100)
        if row['ddomain']  == "":
            if row['domain']  == "":
                row['ddomain'] = "App"
                row['domain'] = " "
        if row['domain'] == row['ddomain']:
            row['domain'] = "Real Site"    
        if row['domain']  == "":
            row['domain'] = "Detected Only"
        if row['ddomain']  == "":
            row['ddomain'] = "Vast Media"
        if row['ddomain'] != row['domain']:
            if row['ddomain'] != "Vast Media":
                if row['domain'] != "Real Site":
                    if row['domain'] != "Detected Only":
                        if row['ddomain'] != "App":
                            row['DATA'] = "FAKE"
                        else:
                            row['DATA'] = "OK"
                    else:
                        row['DATA'] = "OK"
                else:
                    row['DATA'] = "OK"
            else:
                row['DATA'] = "OK"

        writer.writerow(row)

2 个答案:

答案 0 :(得分:0)

以下是答案:

代码:

import pandas as pd 


movies = pd.read_csv('Top20_Media_Yesterday.csv')

movies = movies.sort_values(['impressions'], ascending=False)

movies = movies.to_csv("Top20_Media_Yesterday.csv")

movies = pd.read_csv('Top20_Media_Yesterday.csv', nrows=21)

movies = movies.to_csv("Top20_Media_Yesterday.csv")

答案 1 :(得分:0)

使用pandas框架的DataFrame.sort_values函数,将要排序的列名称传递给by参数,并将axis设置为1。

您可以找到类似的示例here