Question

我正在尝试运行此代码，其中字典的数据保存在单独的csv文件中。这是字典：

body = {
        'dont-ask-for-email': 0,
        'action': 'submit_user_review',
        'post_id': 76196,
        'email': email_random(),
        'subscribe': 1,
        'previous_hosting_id': prev_hosting_comp_random(),
        'fb_token': '',
        'title': review_title_random(),
        'summary': summary_random(),
        'score_pricing': star_random(),
        'score_userfriendly': star_random(),
        'score_support': star_random(),
        'score_features': star_random(),
        'hosting_type': hosting_type_random(),
        'author': name_random(),
        'social_link': '',
        'site': '',
        'screenshot[image][]': '',
        'screenshot[description][]': '',
        'user_data_process_agreement': 1,
        'user_email_popup': '',
        'subscribe_popup': 1,
        'email_asked': 1
}

现在这是写入CSV文件并最终保存的代码：

columns = []
rows = []
chunks = body.split('}')
for chunk in chunks:
    row = []
    if len(chunk)>1:
        entry = chunk.replace('{','').strip().split(',')
        for e in entry:
            item = e.strip().split(':')
            if len(item)==2:
                row.append(item[1])
                if chunks.index(chunk)==0:
                    columns.append(item[0])
        rows.append(row)
df = pd.DataFrame(rows, columns = columns)
df.head()

df.to_csv ('r3edata.csv', index = False, header = True)

但这是我得到的错误：

Traceback (most recent call last):
  File "codeOffshoreupdated.py", line 125, in <module>
    chunks = body.split('}')
AttributeError: 'dict' object has no attribute 'split'

我知道字典没有名为split的属性，但是我该如何解决？

编辑：我想要的CSV格式：

dont-ask-for-email, action, post_id, email, subscribe, previous_hosting_id, fb_token, title, summary, score_pricing, score_userfriendly, score_support, score_features, hosting_type,author, social_link, site, screenshot[image][],screenshot[description][],user_data_process_agreement,user_email_popup,subscribe_popup,email_asked
0,'submit_user_review',76196,email_random(),1,prev_hosting_comp_random(),,review_title_random(),summary_random(),star_random(),star_random(),star_random(),star_random(),hosting_type_random(),name_random(),,,,,1,,1,1

注意：提到的所有这些函数都是返回值

Edit2：

我正在像这样从email_random（）函数中选择电子邮件：

def email_random():
        with open('emaillist.txt') as emails:
                read_emails = csv.reader(emails, delimiter = '\n')
                return random.choice(list(read_emails))[0]

和emaillist.txt就像这样：

xyz@gmail.com
xya@gmail.com
xyb@gmail.com
xyc@gmail.com
xyd@gmail.com

其他功能也从这样的文件中提取数据。

Answer 1

由于body是字典，因此您无需进行任何手动解析即可将其转换为CSV格式。

如果您希望将函数调用（例如email_random()）这样写入CSV，则需要将其包装在引号中（如我在下面所做的那样）。如果希望它们解析为函数调用并编写结果，则可以保持它们不变。

import csv


def email_random():
    return "john@example.com"


body = {
    'dont-ask-for-email': 0,
    'action': 'submit_user_review',
    'post_id': 76196,
    'email': email_random(),
    'subscribe': 1,
    'previous_hosting_id': "prev_hosting_comp_random()",
    'fb_token': '',
    'title': "review_title_random()",
    'summary': "summary_random()",
    'score_pricing': "star_random()",
    'score_userfriendly': "star_random()",
    'score_support': "star_random()",
    'score_features': "star_random()",
    'hosting_type': "hosting_type_random()",
    'author': "name_random()",
    'social_link': '',
    'site': '',
    'screenshot[image][]': '',
    'screenshot[description][]': '',
    'user_data_process_agreement': 1,
    'user_email_popup': '',
    'subscribe_popup': 1,
    'email_asked': 1
}

with open('example.csv', 'w') as fhandle:
    writer = csv.writer(fhandle)
    items = body.items()
    writer.writerow([key for key, value in items])
    writer.writerow([value for key, value in items])

我们在这里做的是：

with open('example.csv', 'w') as fhandle:

这将打开一个具有写入权限（example.csv）的新文件（名为'w'），并将引用存储到变量fhandle中。如果您不熟悉使用with，则可以从this PEP了解有关它们的更多信息。

body.items()将返回一个可迭代的元组（这样做是为了确保以相同的顺序返回字典项）。这样的输出看起来像[('dont-ask-for-email', 0), ('action', 'submit_user_review'), ...]。

然后我们可以使用列表理解功能首先写入所有键，然后在下一行中写入所有值。

这导致

dont-ask-for-email,action,post_id,email,subscribe,previous_hosting_id,fb_token,title,summary,score_pricing,score_userfriendly,score_support,score_features,hosting_type,author,social_link,site,screenshot[image][],screenshot[description][],user_data_process_agreement,user_email_popup,subscribe_popup,email_asked
0,submit_user_review,76196,john@example.com,1,prev_hosting_comp_random(),,review_title_random(),summary_random(),star_random(),star_random(),star_random(),star_random(),hosting_type_random(),name_random(),,,,,1,,1,1

AttributeError：'dict'对象没有属性'split'

1 个答案: