需要一个用于Django tastypie的递归CSV序列化器

时间:2014-05-27 21:41:40

标签: python django csv serialization tastypie

我使用tastypie为我的Django应用程序创建了一个简单的GET only api。我需要提供平板CSV数据,但我的数据库结构已规范化。 Per the documentation我已使用Serializer方法实施了自定义的to_csv()课程,如下所示。

def to_csv(self, data, options=None):
    options = options or {}
    data = self.to_simple(data, options)

    raw_data = StringIO.StringIO()
    writer = csv.writer(raw_data, quotechar="'", quoting=csv.QUOTE_NONNUMERIC)

    if "meta" in data.keys():#if multiple objects are returned
        objects = data.get("objects")
        writer.writerow(objects[0].keys())

        for object in objects:
            test = object.values()
            writer.writerow(test)
    else:
        writer.writerow(data.values())

    CSVContent=raw_data.getvalue()
    return CSVContent

这很好用,除了默认情况下将任何资源呈现为JSON(当我在ModelResource ForeignKey规范中包含full = True时),所以我最终得到包含看起来像这样的嵌套JSON数据的CSV数据。

foodID,foodName,related_details
1,"apricot","{'type':'fruit', 'cost':'medium'}"
2,"beef","{'type':'animal', 'cost':'high'}"
3,"celery","{'type':'vegetable', 'cost':'low'}"

我想要的输出是

foodID,foodName,type,cost
1,"apricot","fruit","medium"
2,"beef","animal","high"
3,"celery","vegetable","low"

我知道我需要递归地应用我的序列化程序,然后在写入CSV之前合并结果,但到目前为止还没有成功。

2 个答案:

答案 0 :(得分:1)

def to_csv(self, data, options=None):
    options = options or {}
    data = self.to_simple(data, options)

    raw_data = StringIO.StringIO()
    first = True

    if "meta" in data.keys():#if multiple objects are returned
        objects = data.get("objects")

        for value in objects:
            test = {}
            self.flatten(value, test)
            if first:
                writer = csv.DictWriter(raw_data, test.keys(), quotechar="'", quoting=csv.QUOTE_NONNUMERIC)
                writer.writeheader()
                writer.writerow(test)
                first=False
            else:
                writer.writerow(test)
    else:
        test = {}
        self.flatten(data, test)
        if first:
            writer = csv.DictWriter(raw_data, test.keys(), quotechar="'", quoting=csv.QUOTE_NONNUMERIC)
            writer.writeheader()
            writer.writerow(test)
            first=False
        else:
            writer.writerow(test)
    CSVContent=raw_data.getvalue()
    return CSVContent

def flatten(self, data, odict = {}):
    if isinstance(data, list):
        for value in data:
            self.flatten(value, odict)
    elif isinstance(data, dict):
        for (key, value) in data.items():
            if not isinstance(value, (dict, list)):
                odict[key] = value
            else:
                self.flatten(value, odict)

答案 1 :(得分:0)

您可以在每行上使用以下函数to_list

def to_list(line):
    idx = -1
    for i, l in enumerate(line):
        if type(l) is str and '{' in l and '}' in l:
            idx = i
            break
    if idx != -1:
        result = line[:idx] + eval(line[idx]).values() + line[idx+1:]
    else:
        result = line
    return result

if __name__ == "__main__":
    lst = [[1,"apricot","{'type':'fruit', 'cost':'medium'}"],
           ["beef","{'type':'animal', 'cost':'high'}", 3],
           ["meat", "sugar"],
           ["{'type':'car', 'cost':'nothing'}", "something"]]
    for line in lst:
        print to_list(line)

所以对于以下列表:

1, "apricot", "{'type':'fruit', 'cost':'medium'}"
"beef", "{'type':'animal', 'cost':'high'}", 3
"meat", "sugar"
"{'type':'car', 'cost':'nothing'}", "something"

你会得到:

1, 'apricot', 'medium', 'fruit'
'beef', 'high', 'animal', 3
'meat', 'sugar'
'nothing', 'car', 'something'

正如您所看到的那样,它不依赖于JSON字符串位置上的元素数量。