网站抓取到CSV项目问题

时间:2019-12-05 03:37:23

标签: python pandas web-scraping

我正在尝试开发一款应用程序,该应用程序可以抓取我最喜欢的十大与太空相关的股票价格。但是

  • 我的代码遇到了一些麻烦,我是新手。
  • 一旦我可以使用它,我想将其放入一个csv文件中
  • 并使用它制作条形图,我希望获得一些帮助和建议。
  • 我也在Anaconda中这样做:

我的代码:

from bs4 import BeautifulSoup 
#grequests is a unique library that allows you to use many urls with ease
#must install qrequest in annacode use : conda install -c conda-forge grequests
#if you know a better way to do this, please let me know
import grequests

#scraping my top ten favorite space companies, attempted to pick companies with pure play interest in space


urls = ['https://finance.yahoo.com/quote/GILT/', 'https://finance.yahoo.com/quote/LORL?p=LORL&.tsrc=fin-srch', 'https://finance.yahoo.com/quote/I?p=I&.tsrc=fin-srch' , 'https://finance.yahoo.com/quote/VSAT?p=VSAT&.tsrc=fin-srch', 'https://finance.yahoo.com/quote/RTN?p=RTN&.tsrc=fin-srch', 'https://finance.yahoo.com/quote/UTX?ltr=1', 'https://finance.yahoo.com/quote/TDY?ltr=1', 'https://finance.yahoo.com/quote/ORBC?ltr=1', 'https://finance.yahoo.com/quote/SPCE?p=SPCE&.tsrc=fin-srch', 'https://finance.yahoo.com/quote/BA?p=BA&.tsrc=fin-srch',]  
unsent_request = (grequests.get(url) for url in urls)

results = grequests.map(unsent_request)


def  parsePrice():
    soup = BeautifulSoup(r.text,"html")
    price=soup.find_all('div',{'class':'Trsdu(0.3s) Fw(b) Fz(36px) Mb(-4px) D(ib)" data-reactid="52">4.1500'})[0].find('span').text
    return price

    #Trying to figure out how to add this 
    # Random delay
    duration = random.uniform(1, 4)
    print('Random delay for %.2f seconds' % duration)
    time.sleep(duration)

    print('Scraping website for', urls)
    urls = hrefs[stock]
    soup = BeautifulSoup(page.text, 'html.parser')

此后,我得到此错误:

File "<ipython-input-17-834b9d6c20e5>", line 3
    duration = random.uniform(1, 4)
    ^
IndentationError: unexpected indent

while True:
    print('current stock price: '+str(parsePrice()))

#add to csv file 
df_indu = pd.DataFrame(
    L['Top Ten Space Stocks'],
    columns=['stock name', 'stock price', 'date of listing'])
df_indu.to_csv('spacestocks.csv', index=False, sep='|')

这是到目前为止我的整个程序。我正在尝试添加时间延迟并抓取所有这些股票的价格以放置在csv文件中以图形显示...谢谢

1 个答案:

答案 0 :(得分:0)

您的parsePrice()函数定义的代码缩进。在函数结尾处,您声明return price,从而结束了函数定义。现在,您的以下代码不应再缩进。这就是为什么之后出现所有意外的缩进错误的原因。