我有一个MongoDB查询,根据日期和返回计数(这是使用count: { $sum: 1 }
的5分钟窗口中的文档总数)按5分钟窗口进行分组。
如果该组中没有文档,我希望查询还为特定的5分钟窗口返回0。但是,目前看起来只返回具有正数的组。
当前查询:
const cursor = await collection.aggregate([
{ $sort : { time : 1 } },
{
$match: {
$and: [
{selector: string },
{time: {$gte: timestamp }}
]
}
},
{
$group: {
_id: {
$subtract: [
{ $subtract: [ "$time", 0 ] },
{ $mod: [
{ $subtract: [ "$time", 0 ] },
1000 * 60 * 5
]}
],
},
count: { $sum: 1 }
}
}
])
预期回复:包含总和0的文件数量的时间戳
{ _id: 1525162000000, count: 314 }
{ _id: 1523144100000, count: 0 }
{ _id: 1512155500000, count: 54 }
提前致谢!
答案 0 :(得分:1)
免责声明:我不建议在服务器端执行此操作(因此在MongoDB内部),而是在客户端处理该情况。
也就是说,这是一个针对您的问题的通用解决方案,应该可以轻松适应您的具体情况。
想象一下,您有以下文档(或示例中的聚合管道输出):
{
"category" : 1
}
{
"category" : 1
}
// note the missing { category: 2 } document here
{
"category" : 3
}
以下管道将创建空桶(因此&{34;间隙"在category
字段中的值范围中缺少的值为0的文档 - 在这种情况下为2):
var bucketSize = 1;
db.getCollection('test').aggregate({
$group: {
_id: null, // throw all documents into the same bucket
"min": { $min: "$category" }, // just to calculate the lowest
"max": { $max: "$category" }, // and the highest "category" value
"docs": { $push: "$$ROOT" } // and also keep the root documents
}
}, {
$addFields: {
"docs": { // modify the existing docs array - created in the previous stage
$concatArrays: [ // by concatenating
"$docs", // the existing docs array
{
$map: { // with some other array that will be generated
input: {
$range: [ "$min", "$max", bucketSize ] // based on the min and max values and the bucket size
},
as: "this",
in: { // but represented not as a plain number but as a document that effectively creates a bogus document
"category": "$$this", // the bogus category will be set to the respective value
"bogus": 1 // marker that allows us not to count this document in the next stage and still get a bucket from $group
}
}
}
]
}
}
}, {
$unwind: "$docs" // flatten the "docs" array which will now contain the bogus documents, too
}, {
$group: {
_id: "$docs.category", // group by category
"count": { // this is the result we are interested in
$sum: { // which will be aggregated by calculating the sum for each document of
$cond: [ // either 0 or 1 per document
{ $eq: [ "$docs.bogus", 1 ] }, // depending on whether the document should count as a result or not
0,
1
]
}
}
}
})
以上查询的输出将为:
{
"_id" : 2,
"count" : 0.0 // this is what we wanted to achieve
}
{
"_id" : 3,
"count" : 1.0 // correct number of matches
}
{
"_id" : 1,
"count" : 2.0 // correct number of matches
}