AttributeError:'dict'对象没有属性'split'

时间:2020-05-23 14:34:12

标签: python json python-3.x python-2.7

我正在尝试运行此代码,其中字典的数据保存在单独的csv文件中。 这是字典:

body = {
        'dont-ask-for-email': 0,
        'action': 'submit_user_review',
        'post_id': 76196,
        'email': email_random(),
        'subscribe': 1,
        'previous_hosting_id': prev_hosting_comp_random(),
        'fb_token': '',
        'title': review_title_random(),
        'summary': summary_random(),
        'score_pricing': star_random(),
        'score_userfriendly': star_random(),
        'score_support': star_random(),
        'score_features': star_random(),
        'hosting_type': hosting_type_random(),
        'author': name_random(),
        'social_link': '',
        'site': '',
        'screenshot[image][]': '',
        'screenshot[description][]': '',
        'user_data_process_agreement': 1,
        'user_email_popup': '',
        'subscribe_popup': 1,
        'email_asked': 1
}

现在这是写入CSV文件并最终保存的代码:

columns = []
rows = []
chunks = body.split('}')
for chunk in chunks:
    row = []
    if len(chunk)>1:
        entry = chunk.replace('{','').strip().split(',')
        for e in entry:
            item = e.strip().split(':')
            if len(item)==2:
                row.append(item[1])
                if chunks.index(chunk)==0:
                    columns.append(item[0])
        rows.append(row)
df = pd.DataFrame(rows, columns = columns)
df.head()

df.to_csv ('r3edata.csv', index = False, header = True)

但这是我得到的错误:

Traceback (most recent call last):
  File "codeOffshoreupdated.py", line 125, in <module>
    chunks = body.split('}')
AttributeError: 'dict' object has no attribute 'split'

我知道字典没有名为split的属性,但是我该如何解决?

编辑: 我想要的CSV格式:

dont-ask-for-email, action, post_id, email, subscribe, previous_hosting_id, fb_token, title, summary, score_pricing, score_userfriendly, score_support, score_features, hosting_type,author, social_link, site, screenshot[image][],screenshot[description][],user_data_process_agreement,user_email_popup,subscribe_popup,email_asked
0,'submit_user_review',76196,email_random(),1,prev_hosting_comp_random(),,review_title_random(),summary_random(),star_random(),star_random(),star_random(),star_random(),hosting_type_random(),name_random(),,,,,1,,1,1

注意:提到的所有这些函数都是返回值

Edit2:

我正在像这样从email_random()函数中选择电子邮件:

def email_random():
        with open('emaillist.txt') as emails:
                read_emails = csv.reader(emails, delimiter = '\n')
                return random.choice(list(read_emails))[0]

和emaillist.txt就像这样:

xyz@gmail.com
xya@gmail.com
xyb@gmail.com
xyc@gmail.com
xyd@gmail.com 

其他功能也从这样的文件中提取数据。

1 个答案:

答案 0 :(得分:1)

由于body是字典,因此您无需进行任何手动解析即可将其转换为CSV格式。

如果您希望将函数调用(例如email_random())这样写入CSV,则需要将其包装在引号中(如我在下面所做的那样)。如果希望它们解析为函数调用并编写结果,则可以保持它们不变。

import csv


def email_random():
    return "john@example.com"


body = {
    'dont-ask-for-email': 0,
    'action': 'submit_user_review',
    'post_id': 76196,
    'email': email_random(),
    'subscribe': 1,
    'previous_hosting_id': "prev_hosting_comp_random()",
    'fb_token': '',
    'title': "review_title_random()",
    'summary': "summary_random()",
    'score_pricing': "star_random()",
    'score_userfriendly': "star_random()",
    'score_support': "star_random()",
    'score_features': "star_random()",
    'hosting_type': "hosting_type_random()",
    'author': "name_random()",
    'social_link': '',
    'site': '',
    'screenshot[image][]': '',
    'screenshot[description][]': '',
    'user_data_process_agreement': 1,
    'user_email_popup': '',
    'subscribe_popup': 1,
    'email_asked': 1
}

with open('example.csv', 'w') as fhandle:
    writer = csv.writer(fhandle)
    items = body.items()
    writer.writerow([key for key, value in items])
    writer.writerow([value for key, value in items])

我们在这里做的是:

with open('example.csv', 'w') as fhandle:

这将打开一个具有写入权限(example.csv)的新文件(名为'w'),并将引用存储到变量fhandle中。如果您不熟悉使用with,则可以从this PEP了解有关它们的更多信息。

body.items()将返回一个可迭代的元组(这样做是为了确保以相同的顺序返回字典项)。这样的输出看起来像[('dont-ask-for-email', 0), ('action', 'submit_user_review'), ...]

然后我们可以使用列表理解功能首先写入所有键,然后在下一行中写入所有值。

这导致

dont-ask-for-email,action,post_id,email,subscribe,previous_hosting_id,fb_token,title,summary,score_pricing,score_userfriendly,score_support,score_features,hosting_type,author,social_link,site,screenshot[image][],screenshot[description][],user_data_process_agreement,user_email_popup,subscribe_popup,email_asked
0,submit_user_review,76196,john@example.com,1,prev_hosting_comp_random(),,review_title_random(),summary_random(),star_random(),star_random(),star_random(),star_random(),hosting_type_random(),name_random(),,,,,1,,1,1