Question

MongoDB中输出格式的聚合函数有多灵活？

数据格式：

{
        "_id" : ObjectId("506ddd1900a47d802702a904"),
        "port_name" : "CL1-A",
        "metric" : "772.0",
        "port_number" : "0",
        "datetime" : ISODate("2012-10-03T14:03:00Z"),
        "array_serial" : "12345"
}

现在我正在使用此聚合函数返回DateTime数组，指标数组和计数：

{$match : { 'array_serial' : array, 
                            'port_name' : { $in : ports},
                            'datetime' : { $gte : from, $lte : to}
                        }
                },
               {$project : { port_name : 1, metric : 1, datetime: 1}},
               {$group : { _id : "$port_name", 
                            datetime : { $push : "$datetime"},
                            metric : { $push : "$metric"},
                            count : { $sum : 1}}}

哪个好，而且非常快，但有没有办法格式化输出，所以每个日期时间/指标有一个数组？像这样：

[
    {
      "_id" : "portname",
      "data" : [
                ["2012-10-01T00:00:00.000Z", 1421.01],
                ["2012-10-01T00:01:00.000Z", 1361.01],
                ["2012-10-01T00:02:00.000Z", 1221.01]
               ]
    }
]

这将极大地简化前端，因为这是图表代码所期望的格式。

Answer 1

使用聚合框架将两个字段组合成一个值数组是可能的，但绝对不是那么简单（至少在MongoDB 2.2.0中）。

以下是一个例子：

db.metrics.aggregate(

    // Find matching documents first (can take advantage of index)
    { $match : {
        'array_serial' : array, 
        'port_name' : { $in : ports},
        'datetime' : { $gte : from, $lte : to}
    }},

    // Project desired fields and add an extra $index for # of array elements
    { $project: {
        port_name: 1,
        datetime: 1,
        metric: 1,
        index: { $const:[0,1] }
    }},

    // Split into document stream based on $index
    { $unwind: '$index' },

    // Re-group data using conditional to create array [$datetime, $metric]
    { $group: {
        _id: { id: '$_id', port_name: '$port_name' },
        data: {
            $push: { $cond:[ {$eq:['$index', 0]}, '$datetime', '$metric'] }
        },
    }},

    // Sort results
    { $sort: { _id:1 } },

    // Final group by port_name with data array and count
    { $group: {
        _id: '$_id.port_name',
        data: { $push: '$data' },
        count: { $sum: 1 }
    }}
)

Answer 2

MongoDB 2.6通过引入$map使这更容易，这允许更简单的数组转换形式：

db.metrics.aggregate([
   { "$match": {
       "array_serial": array, 
       "port_name": { "$in": ports},
       "datetime": { "$gte": from, "$lte": to }
    }},
    { "$group": {
        "_id": "$port_name",
        "data": {
            "$push": {
                "$map": {
                    "input": [0,1],
                    "as": "index",
                    "in": {
                        "$cond": [
                            { "$eq": [ "$$index", 0 ] },
                            "$datetime",
                            "$metric"
                        ]
                    }
                }
            }
        },
        "count": { "$sum": 1 }
    }}
])

与$unwind的方法非常相似，您将数组作为“输入”提供给包含两个值的地图操作，然后基本上通过$cond将这些值替换为您想要的字段值操作

这实际上删除了以前版本中所需的转换文档所需的所有管道杂耍，并将实际聚合留给了手头的作业，这基本上是按“port_name”值累积的，并且转换为数组是no更长的问题区域。

Answer 3

在没有$ push和$ addToSet的聚合框架中构建数组似乎缺乏。我之前试图让它工作，然后失败了。如果你能这样做会很棒：

data : {$push: [$datetime, $metric]}

$group中的

，但这不起作用。

此外，构建像这样的“文字”对象不起作用：

data : {$push: {literal:[$datetime, $metric]}}
or even data : {$push: {literal:$datetime}}

我希望他们最终想出一些更好的方法来按摩这类数据。

MongoDB聚合框架$ group可以返回一组值吗？

3 个答案: