Question

我有一个用例，我从SPARQL端点提取数据如下：

SPARQL = SPARQLWrapper(endpointURL)  
SPARQL.setQuery(queryString) 
SPARQL.setReturnFormat(JSON) 
results = SPARQL.query().convert()

'results'变量保存现在我想在CSV文件中写入的数据。首先，我按如下方式为CSV创建了标题：

schema = ['Age', 'Sex', 'Chest_Pain_Type', 'Trestbps', 'Chol', 'Fasting_Glucose_Level', 'Resting_ECG_Type', 'ThalachD', 'Exercise_Induced_Angina', 'OldpeakD', 'CaD', 'Slope', 'Thallium_Scintigraphy', 'Diagnosis']

然后，我使用以下代码在CSV文件中写入每行的数据：

with open('DIC.csv', 'w+') as csvfile: 
        writer = csv.writer(csvfile, delimiter=',') 
        writer.writerow([g for g in schema]) 
        for result in results["results"]["bindings"]: 
                    writer.writerow([result["Age"]["value"], 
                    result["SexTypes"]["value"],
                    result["Chest_Pain_Type"]["value"], 
                    result["trestbpsD"]["value"],
                    result["cholD"]["value"], 
                    result["Fasting_Glucose_Level"]["value"],
                    result["Resting_ECG_Type"]["value"],
                    result["thalachD"]["value"],
                    result["Exercise_Induced_Angina"]["value"],
                    result["oldpeakD"]["value"],
                    result["caD"]["value"],
                    result["Slope"]["value"],
                    result["Thallium_Scintigraphy"]["value"],
                    result["Diagnosis"]["value"]])
        file_name = csvfile.name
        csvfile.close()

在前面的代码块中， result [“SexTypes”] [“value”] 用于写入列“SexTypes”的值（即“值”）。这意味着，第一个索引是可变的，但第二个索引始终是相同的。

虽然，上面的代码工作正常但是这几乎是硬编码的，并且一旦我的SPARQL查询发生变化（例如，如果模式不同）就会失败。

我现在想让它更灵活，以便我迭代列表中的所有列，但“值”是固定的。为此，我尝试使用以下代码，最终失败：

with open('DIC.csv', 'w+') as csvfile: 
        writer = csv.writer(csvfile, delimiter=',') 
        writer.writerow([g for g in schema]) 
        r = "value"
        for result in results["results"]["bindings"]: 
            for i in range(len(schema)):
                writer.writerow([result[schema[i]][r]])
                i = i + 1       
        file_name = csvfile.name
        csvfile.close()
        return file_name

我知道，我做错了。还有什么更好的建议吗？

[也许问题值得一个更好的标题，但我找不到任何。抱歉，我的英语不好。]

Answer 1

第一个示例代码中writer.writerow()的输入是一个列表，长度与schema相同，而在第二个中，它是长度为1的列表（而不是调用writerow()一旦你在schema中每个元素调用一次。

要生成您首先使用列表理解的列表：

row = [result[column]["value"] for column in schema]

相当于：

row = [] # create empty list
for column in schema:
    row.append(result[column]["value"])

最终，只需要在results：

的循环内修改代码

...
for result in results["results"]["bindings"]: 
    row = [result[column]["value"] for column in schema]
    writer.writerow(row)
file_name = csvfile.name
...

如何迭代并将列表写为CSV标头？

1 个答案: