如何使用Idealista API获取房地产数据?

时间:2016-10-13 14:32:37

标签: python oauth2

我一直在尝试使用网站Idealista(https://www.idealista.com/)的API来检索房地产数据的信息。

由于我不熟悉OAuth2,到目前为止还没能获得令牌。我刚刚获得了api密钥,秘密以及如何挂载http请求的一些基本信息。

我很感激这个API功能的一个例子(最好是Python),或者一些关于处理OAuth2和Python的更一般的信息。

4 个答案:

答案 0 :(得分:3)

经过几天的研究,我想出了一个基本的python代码,用于从Idealista API中检索房地产数据。

def get_oauth_token():
http_obj = Http()
url = "https://api.idealista.com/oauth/token"
apikey= urllib.parse.quote_plus('Provided_API_key')
secret= urllib.parse.quote_plus('Provided_API_secret')
auth = base64.encode(apikey + ':' + secret)
body = {'grant_type':'client_credentials'}
headers = {'Content-Type': 'application/x-www-form-urlencoded;charset=UTF-8','Authorization' : 'Basic ' + auth}
resp, content = http_obj.request(url,method='POST',headers=headers, body=urllib.parse.urlencode(body))
return content

此函数将返回带有OAuth2令牌的JSON和以秒为单位的会话时间。然后,要查询API,它将非常简单:

def search_api(token):
http_obj = Http()
url = "http://api.idealista.com/3.5/es/search?center=40.42938099999995,-3.7097526269835726&country=es&maxItems=50&numPage=1&distance=452&propertyType=bedrooms&operation=rent"
headers = {'Authorization' : 'Bearer ' + token}
resp, content = http_obj.request(url,method='POST',headers=headers)
return content

这次我们会在内容var中找到我们正在寻找的数据,再次作为JSON。

答案 1 :(得分:0)

这是我的代码,改进了#3 ......运行正常!为了我!!!! 只把你的apikey和你的密码(秘密)......

import pandas as pd
import json
import urllib
import requests as rq
import base64

def get_oauth_token():
    url = "https://api.idealista.com/oauth/token"    
    apikey= 'your_api_key' #sent by idealista
    secret= 'your_password'  #sent by idealista
    auth = base64.b64encode(apikey + ':' + secret)
    headers = {'Content-Type': 'application/x-www-form-urlencoded;charset=UTF-8' ,'Authorization' : 'Basic ' + auth}
    params = urllib.urlencode({'grant_type':'client_credentials'})
    content = rq.post(url,headers = headers, params=params)
    bearer_token = json.loads(content.text)['access_token']
    return bearer_token

def search_api(token, url):  
    headers = {'Content-Type': 'Content-Type: multipart/form-data;', 'Authorization' : 'Bearer ' + token}
    content = rq.post(url, headers = headers)
    result = json.loads(content.text)['access_token']
    return result

country = 'es' #values: es, it, pt
locale = 'es' #values: es, it, pt, en, ca
language = 'es' #
max_items = '50'
operation = 'sale' 
property_type = 'homes'
order = 'priceDown' 
center = '40.4167,-3.70325' 
distance = '60000'
sort = 'desc'
bankOffer = 'false'

df_tot = pd.DataFrame()
limit = 10

for i in range(1,limit):
    url = ('https://api.idealista.com/3.5/'+country+'/search?operation='+operation+#"&locale="+locale+
           '&maxItems='+max_items+
           '&order='+order+
           '&center='+center+
           '&distance='+distance+
           '&propertyType='+property_type+
           '&sort='+sort+ 
           '&numPage=%s'+
           '&language='+language) %(i)  
    a = search_api(get_oauth_token(), url)
    df = pd.DataFrame.from_dict(a['elementList'])
    df_tot = pd.concat([df_tot,df])

df_tot = df_tot.reset_index()

答案 2 :(得分:0)

我发现了一些错误。至少我不能运行它。 我相信,我对此有所改善:

import pandas as pd
import json
import urllib
import requests as rq
import base64

def get_oauth_token(): 

    url = "https://api.idealista.com/oauth/token"    

    apikey= 'your_api_key' #sent by idealist
    secret= 'your_password' #sent by idealista
    apikey_secret = apikey + ':' + secret
    auth = str(base64.b64encode(bytes(apikey_secret, 'utf-8')))[2:][:-1]

    headers = {'Authorization' : 'Basic ' + auth,'Content-Type': 'application/x-www-form- 
    urlencoded;charset=UTF-8'}
    params = urllib.parse.urlencode({'grant_type':'client_credentials'}) #,'scope':'read'
    content = rq.post(url,headers = headers, params=params)
    bearer_token = json.loads(content.text)['access_token']

    return bearer_token


def search_api(token, URL):  
    headers = {'Content-Type': 'Content-Type: multipart/form-data;', 'Authorization' : 'Bearer ' + token}
    content = rq.post(url, headers = headers)
    result = json.loads(content.text)

    return result

答案 3 :(得分:0)

从那以后就不能将其标记为正确答案

auth = base64.encode(apikey + ':' + secret)
body = {'grant_type':'client_credentials'}
headers = {'Content-Type': 'application/x-www-form-urlencoded;charset=UTF-8','Authorization' : 'Basic ' + auth}

会给您TypeError:

can only concatenate str (not "bytes") to str 

因为base64encode返回一个字节类型的对象...

确实,Idealista API在文档方面非常有限,但是我认为这是一种更好的方法,因为我不使用不必要的库(仅本机):

#first request
message = API_KEY + ":" + SECRET
auth = "Basic " + base64.b64encode(message.encode("ascii")).decode("ascii")

headers_dic = {"Authorization" : auth, 
               "Content-Type" : "application/x-www-form-urlencoded;charset=UTF-8"}

params_dic = {"grant_type" : "client_credentials",
              "scope" : "read"}



r = requests.post("https://api.idealista.com/oauth/token", 
                  headers = headers_dic, 
                  params = params_dic)

这仅适用于python请求和base64模块...

致谢