当蜘蛛运行时,Scrapy电子邮件发送一次

时间:2014-08-08 08:41:35

标签: python email scrapy scrapy-spider

当我完成抓取页面时,我试图在gmail中发送电子邮件..当我定义函数send_mail并将其传递给下面时,在日志中,它说send_mail("一些消息",& #34;刮板报告") NameError:name' send_mail'没有定义..当蜘蛛完成抓取时,我可以发送gmail。当我在def parse(self,response)方法中传递send_mail函数时,它会尝试阻止我的gmail,因为循环抓取..

 class ekantipurSpider(XMLFeedSpider):
    name= "ekantipur"
    allowed_domains = ["ekantipur.com"]
    start_urls = [
    'http://www.ekantipur.com/archive/'

  ]


send_mail("some message", "Scraper Report")

        def parse(self, response):

          hxs = HtmlXPathSelector(response) # The xPath selector
          titles=hxs.select('//div[@id = "archive-content-wrapper"]//ul/li')
          items = []
          for titles in titles:
           item = NewsItem()
           item['title']=titles.select('h6/a/text()').extract()[0]
           item['link']=titles.select('h6/a/@href').extract()[0]
           item['description']=titles.select('p/text()').extract()[0]
           item = Request(item['link'],meta={'item':item},callback=self.parse_detail)
           items.append(item)



        return items


     def parse_detail(self,response):
      item = response.meta['item']
      sel = HtmlXPathSelector(response)
      detail = sel.select('//div[@class = "main_wrapper_left"]')
      item['details'] = escape(''.join(detail.select('p/text()').extract()))
      locationDate = item['details'].split(':',1)[0]
      item['location']= locationDate.split(",",1)[0]
      item['published_date'] =    escape(''.join(detail.select('p[last()]/text()').extract()))


      return item


def send_mail(self, message, title):
  print "Sending mail..........."
  gmailUser = 'manthali2014@gmail.com'
  gmailPassword = 'rameshkc8  '
  recipient = 'kcramesh8@gmail.com'

 msg = MIMEMultipart()
 msg['From'] = gmailUser
 msg['To'] = recipient
 msg['Subject'] = title
 msg.attach(MIMEText(message))

 mailServer = smtplib.SMTP('smtp.gmail.com', 587)
 mailServer.ehlo()
 mailServer.starttls()
 mailServer.ehlo()
 mailServer.login(gmailUser, gmailPassword)
 mailServer.sendmail(gmailUser, recipient, msg.as_string())
 mailServer.close()
 print "Mail sent"

1 个答案:

答案 0 :(得分:1)

如果他们在同一个班级,您需要添加self.来发送邮件(self.send_mail(args))。如果它们不是,则需要在定义函数后调用send_mail()。