读取熊猫的json文件的网址无效

时间:2020-07-02 15:38:48

标签: json pandas

我想在熊猫中使用URL读取json文件,但这会引发一些错误,我认为这是我给出的相关路径,请查看代码和url。我没有在堆栈溢出中找到它,所以问了它。这可能是重复的,但请帮助我。

https://bugs.chromium.org/p/chromium/issues/detail?id=1101804

json_url在下面:

https://github.com/jackiekazil/data-wrangling/blob/master/data/chp3/data-text.json

错误消息:

data_df = pd.read_json('https://github.com/jackiekazil/data-wrangling/blob/master/data/chp3/data-text.json')

2 个答案:

答案 0 :(得分:1)

您需要以下URL:https://raw.githubusercontent.com/jackiekazil/data-wrangling/master/data/chp3/data-text.json

raw.githubusercontent.com提供在Github存储库中找到的未处理文件。您在问题中发布的链接不会为您提供您感兴趣的原始JSON文件;相反,它会加载网页本身。 this answer.

中也有介绍

使用上述URL,您的代码对我来说效果很好。

答案 1 :(得分:1)

  • 您将无法直接通过熊猫读取文件
  • 您将需要下载它
  • 顺便说一句,下面的功能可用于从正确的URL下载任何文件。
    • 该功能需要将URL和目录保存到其中。
import request
from pathlib import Path
import pandas as pd

def create_dir_save_file(dir_path: Path, url: str):
    """
    Check if the path exists and create it if it does not.
    Check if the file exists and download it if it does not.
    """
    if not dir_path.parents[0].exists():  # if directory doesn't exist
        dir_path.parents[0].mkdir(parents=True)  # create directory
        print(f'Directory Created: {dir_path.parents[0]}')
    else:
        print('Directory Exists')
        
    if not dir_path.exists():  # if file doesn't exist
        r = requests.get(url, allow_redirects=True)  # get file
        open(dir_path, 'wb').write(r.content)  # write file
        print(f'File Created: {dir_path.name}')
    else:
        print('File Exists')
        

data_dir = Path.cwd()  # current working dir or use Path('e:/some_path') to specify a location
url = 'https://raw.githubusercontent.com/jackiekazil/data-wrangling/master/data/chp3/data-text.json'
file_name = url.split('/')[-1]
data_path = data_dir / file_name  # local path to data once downloaded
create_dir_save_file(data_path, url)  # call function to download file

df = pd.json_normalize(data_path)  create dataframe

# display(df.head())

                           Indicator PUBLISH STATES  Year             WHO region World Bank income group               Country         Sex  Display Value  Numeric Low High Comments
0   Life expectancy at birth (years)      Published  1990                 Europe             High-income               Andorra  Both sexes             77     77.0                  
1   Life expectancy at birth (years)      Published  2000                 Europe             High-income               Andorra  Both sexes             80     80.0                  
2  Life expectancy at age 60 (years)      Published  2012                 Europe             High-income               Andorra      Female             28     28.0                  
3  Life expectancy at age 60 (years)      Published  2000                 Europe             High-income               Andorra  Both sexes             23     23.0                  
4   Life expectancy at birth (years)      Published  2012  Eastern Mediterranean             High-income  United Arab Emirates      Female             78     78.0