熊猫-循环并写入多行

时间:2019-04-10 17:09:23

标签: python pandas for-loop

现在,我代码中的循环正在覆盖同一行。 我如何转到下一行?

理想结果:对于用户输入的每个链接,数据将被写入唯一的行

from bs4 import BeautifulSoup
import urllib.request
import pandas as pd


def get_bullets(urls):
    urls = urls.split(",")
    for url in urls:
          page = urllib.request.urlopen(url)
          soup = BeautifulSoup(page,'lxml')
          sku = url.split('/')[5]
          content = soup.find('div', class_='js-productHighlights product-highlights c28 fs14 js-close')
          bullets = content.find_all('li', class_='top-section-list-item')        
          bullets_text = '\n'.join([ bullet.text for bullet in bullets ])

          temp_df = pd.DataFrame([[sku, bullets_text]], columns = ['sku','bullets'])
          temp_df.to_csv('book2.csv', index=False)

get_bullets(input('enter urls'))

用户输入是:https://www.bhphotovideo.com/c/product/1473086-REG/canon_3453c001_eos_rebel_sl3_dslr.html,https://www.bhphotovideo.com/c/product/1346734-REG/canon_eos_6d_mark_ii.html

谢谢!

1 个答案:

答案 0 :(得分:0)

您正在将数据写入每个循环的csv中。也许将每个循环的结果存储在列表中,然后将结果连接起来再写入磁盘?

def get_bullets(urls):
    urls = urls.split(",")
    dfs = []
    for url in urls:
        # do loop stuff
        temp_df = ...
        dfs.append(temp_df)
    df = pd.concat(dfs, ignore_index=True)
    df.to_csv('book2.csv', index=False)