如何将数据附加到python和beutifulsoup中循环生成的数据框中

时间:2017-10-19 03:28:19

标签: python python-3.x beautifulsoup

下面的循环假设在一个数据帧中添加多个表的行(html页面)。循环工作正常,它逐个为每个表创建一个数据帧,但它也从我想要修复的数据帧中替换以前表的数据。它应该将每个表的数据附加到同一个数据帧中,它不应该从数据帧中替换先前表的数据。 Plase帮我解决这个问题。

column_headers = ['state', 'sr_no', 'district_name', 'country']

headers = ['district_id']

    district_link = [[li.get('href') for li in data_rows_link[i].findAll('a')]
               for i in range(len(data_rows))]

district_data_02 = []  # create an empty list to hold all the data

for i in range(len(data_rows)):  # for each table row
    district_row = []  # create an empty list for each pick/player
    district_row.append("a")

    # for each table data element from each table row
    for li in data_rows[i].findAll('li'):
        # get the text content and append to the district_row
        district_row.append(li.getText())

    # then append each pick/player to the district_data matrix
    district_data_02.append(district_row)

district_data == district_data_02

#dataframe - districtlist
districtlist = pd.DataFrame(district_data ,columns=column_headers)

districtid = pd.DataFrame(district_link, columns=headers)

#df_row_merged = pd.concat([df, df1])

#dataframe - districtid
final_districtlist =pd.concat([districtlist, districtid], axis=1)

0 个答案:

没有答案