使用beautifulsoup获取电子邮件ID

时间:2014-02-21 07:22:19

标签: python beautifulsoup

我正在研究python。我正在学习beautifulsoup&我正在解析一个链接。

我的网址:

http://www.dtemaharashtra.gov.in/approvedinstitues/StaticPages/frmInstituteSummary.aspx?InstituteCode=1002

我想从该网址解析电子邮件ID 我怎么能这样做?

2 个答案:

答案 0 :(得分:1)

import requests
from bs4 import BeautifulSoup
r = requests.get("http://www.dtemaharashtra.gov.in/approvedinstitues/StaticPages/frmInstituteSummary.aspx?InstituteCode=1002")
soup = BeautifulSoup(r.text)
soup.find("span", {"id":"ctl00_rightContainer_ContentBox1_lblEMailAddress"}).text

答案 1 :(得分:1)

import urllib2
from bs4 import BeautifulSoup

html = urllib2.urlopen('http://www.dtemaharashtra.gov.in/approvedinstitues/StaticPages/frmInstituteSummary.aspx?InstituteCode=1002').read()
soup = BeautifulSoup(html)
print soup.find(id='ctl00_rightContainer_ContentBox1_lblEMailAddress').text