下载的文件已损坏

时间:2019-09-14 06:46:20

标签: python selenium-webdriver beautifulsoup

我编写了一个脚本来从Web的多个页面下载某些文件。下载似乎正常,但所有文件均损坏。我尝试了其他方法来下载文件,但始终给我损坏的文件,并且所有文件的大小都只有4 kb。

我需要在哪里更改或修改代码以解决下载问题?

while pageCounter < 3:
    soup_level1 = BeautifulSoup(driver.page_source, 'lxml')
    for div in soup_level1.findAll('div', attrs ={'class':'financial-report-download ng-scope'}):
        links = div.findAll('a', attrs = {'class':'ng-binding'}, href=re.compile("FinancialStatement"))
        for a in links:
            driver.find_element_by_xpath("//div[@ng-repeat = 'attachments in res.Attachments']").click()
            files = [url + a['href']]
            for file in files:
                file_name = file.split('/')[-1]    

                print ("Downloading file:%s"%file_name)

                # create response object 
                r = requests.get(file, stream = True) 

                # download started 
                with open(file_name, 'wb') as f: 
                    for chunk in r.iter_content(chunk_size=1024*1024): 
                        if chunk: 
                            f.write(chunk)

                print ("%s downloaded!\n"%file_name)

0 个答案:

没有答案