使用python scrapy将项目输出到csv文件 - 如何在csv文件中输出问题

时间:2011-07-10 13:14:16

标签: python scrapy

有一个问题,我想将输出添加到csv文件,但它没有从字段名称下面开始,它按顺序放在下一行,而不是在填充csv中的playerMins项目时将其放在第2行文件。有人可以告诉我我的代码出错了吗?这是:

class EspnSpider3(BaseSpider):
    name = "espn3.org"
    allowed_domains = ["espn3.org"]
    start_urls = [
        "http://scores.espn.go.com/nba/boxscore?gameId=310502004"

    ]

    def parse(self, response):
        hxs = HtmlXPathSelector(response)
        item = EspnItem()
        rows = []
        playerName = []
        playerMins = []

        # player names 
        p_names = hxs.select('(//table[@class="mod-data"][1]/tbody/tr)//a/text()').extract()
        for p_name in p_names:
            print p_name
            yield EspnItem(playerName=p_name)

        # minutes
        p_minutes = hxs.select('(//table[@class="mod-data"][1]/tbody/tr)/td[2]').extract()
        for p_minute in p_minutes:
            print p_minute
            yield EspnItem(playerMins=p_minute)

1 个答案:

答案 0 :(得分:2)

经过大量的谷歌搜索和rtfm之后能够解决我的问题:Trying to Use an ItemExporter in Scrapy

这是我的工作代码:

def parse(self, response):
    hxs = HtmlXPathSelector(response)
    player_names = hxs.select('(//table[@class="mod-data"][1]/tbody/tr)')
    for p_name in player_names:
        l = XPathItemLoader(item=EspnItem(), selector=p_name )
        l.add_xpath('playerName', 'td[1]/a/text()')
        l.add_xpath('playerMins', 'td[2]')
        yield l.load_item()