Question

我有200多个json格式的报废文件，然后我想分析。我可以单独打开它们，但想循环以节省时间，因为我会经常这样做。

可以打开每个文件，但希望能够以某种格式进行循环例如

with codecs.open('c:\\project\\input*.json','r','utf-8') as f:

其中“ *”是数字.....

import codecs, json, csv, re

#read a json file downloaded with twitterscraper

with codecs.open('c:\\project\\input1.json','r','utf-8') as f:
    tweets = json.load(f,encoding='utf-b')

Answer 1

只需将文件放入文件夹中，然后像这样遍历文件夹中的文件即可。

import codecs
import json
import csv
import re
import os

files = []
for file in os.listdir("/mydir"):
    if file.endswith(".json"):
        files.append(os.path.join("/mydir", file))

for file in files:
    with codecs.open(file,'r','utf-8') as f: 
        tweets = json.load(f,encoding='utf-b')

Answer 2

添加并使用glob遍历具有特定文件模式的文件。

import glob
import codecs
import json
# ... more packages here

for file in glob.glob('c:\\project\\input*.json'):
    with codecs.open(file, 'r','utf-8') as f: 
        tweets = json.load(f, encoding='utf-b')
        #... whatever you do next with `tweets`

BTW：utf-b而不是utf-8？

遍历多个json输入文件

2 个答案: