我使用的是MongoDB v3.2,我正在使用native nodejs驱动程序v2.1。在大型数据集(1mil +文档)上运行聚合管道时,我遇到以下错误:
'aggregation result exceeds maximum document size (16MB)'
这是我的聚合管道代码:
var eventCollection = myMongoConnection.db.collection('events');
var cursor = eventCollection.aggregate([
{
$match: {
event_type_id: {$eq: 89012}
}
},
{
$group: {
_id: "$user_id",
score: {$sum: "$points"}
}
},
{
$sort: {
score: -1
}
}
],
{
cursor: {
batchSize: 500
},
allowDiskUse: true,
explain: false
}, function () {
});
我尝试的事情:
//Using cursor event listeners. None of the on listeners seem to work. Always get error about 16mb.
cursor.on("data", function (data) {
console.log("Some data: ", data);
});
cursor.on("end", function (data) {
console.log("End of data: ", data);
});
//Using forEach. Which I thought would allow for >16mb because it's used in conjunction with the batchSize and cursor.
cursor.forEach(function (item) {
})
我在其他答案(How could I write aggregation without exceeds maximum document size?)中看到我需要通过游标返回结果,那么我该如何正确地做到这一点呢?我似乎无法让它发挥作用。有关batchSize应该是什么的任何建议吗?
我使用本机mongodb包 - https://github.com/mongodb/node-mongodb-native用于nodejs项目而不是mongo命令行。
答案 0 :(得分:2)
好的我明白了。它没有工作,因为我传入一个回调函数作为聚合方法中的最后一个参数。通过传递null,它允许流按预期工作。更改如下所示:
var cursor = eventCollection.aggregate([
{
$match: {
event_type_id: {$eq: 89012}
}
},
{
$group: {
_id: "$user_id",
score: {$sum: "$points"}
}
},
{
$sort: {
score: -1
}
}
],
{
cursor: {
batchSize: 500
},
allowDiskUse: true,
explain: false
}, null);