Question

背景：

我有一个从数据库中获取一堆属性的函数。这是函数：

def getData(key, full_name, address, city, state, zipcode):
    try:
        url = 'https://personator.melissadata.net/v3/WEB/ContactVerify/doContactVerify'
        payload={
                'TransmissionReference': "test", # used by you to keep track of reference
                'Actions': 'Check',
                'Columns': 'Gender','DateOfBirth','DateOfDeath','EthnicCode','EthnicGroup','Education','PoliticalParty','MaritalStatus','HouseholdSize','ChildrenAgeRange','PresenceOfChildren','PresenceOfSenior','LengthOfResidence','OwnRent','CreditCardUser','Occupation','HouseholdIncome',
                'CustomerID': key,# key
                'Records': [{'FullName': str(full_name), 'AddressLine1': str(address), 'City': str(city), 'State': str(state), 'PostalCode': str(zipcode)}]
                }
        headers = {'Content-Type': 'application/json; charset=utf-8', 'Accept':'application/json', 'Host':'personator.melissadata.net','Expect': '100-continue', 'Connection':'Keep-Alive'}
        r = requests.post(url, data=json.dumps(payload), headers=headers)
        dom = json.loads(r.text)

        Gender = dom['Records'][0]['Gender']
        DateOfBirth = dom['Records'][0]['DateOfBirth']
        DateOfDeath = dom['Records'][0]['DateOfDeath']
        EthnicCode = dom['Records'][0]['EthnicCode']
        EthnicGroup = dom['Records'][0]['EthnicGroup']
        Education = dom['Records'][0]['Education']
        PoliticalParty = dom['Records'][0]['PoliticalParty']
        MaritalStatus = dom['Records'][0]['MaritalStatus']
        HouseholdSize = dom['Records'][0]['HouseholdSize']
        ChildrenAgeRange = dom['Records'][0]['ChildrenAgeRange']
        PresenceOfChildren = dom['Records'][0]['PresenceOfChildren']
        PresenceOfSenior = dom['Records'][0]['PresenceOfSenior']
        LengthOfResidence = dom['Records'][0]['LengthOfResidence']
        OwnRent = dom['Records'][0]['OwnRent']
        CreditCardUser = dom['Records'][0]['CreditCardUser']
        Occupation = dom['Records'][0]['Occupation']
        HouseholdIncome = dom['Records'][0]['HouseholdIncome']

        return Gender
    except:
        return None

要创建“性别”列，我将函数包装为lambda

df['Gender'] = df.apply(lambda row: getData(key, row['Full Name'], row['Address'], row['City'], row['State'], row['Zipcode']))

目标：我想针对您在“性别”下面看到的所有其他属性同时执行此过程，如何在Python中执行此操作。

Answer 1

您可以返回字典，然后展开一系列字典对象：

fields = ['Gender', 'DateOfBirth', etc.]

def getData(key, full_name, address, city, state, zipcode):
    try:
        # your code as before
        dom = json.loads(r.text)
        return {k: dom['Records'][0][k] for k in fields}
    # modify below: good practice to specify exactly which error(s) to catch
    except:
        return {}

然后扩展您的词典系列：

dcts = df.apply(lambda row: getData(key, row['Full Name'], row['Address'], row['City'],
                                    row['State'], row['Zipcode']), axis=1)

df = df.join(pd.DataFrame(dcts.tolist()))

根据@spaniard的评论，如果要所有可用的字段，则可以简单地使用：

return json.loads(r.text)['Records'][0]

在python中返回多个值，并将它们附加到数据框的唯一列中

1 个答案: