我有这样的文件收藏:
{
"_id" : ObjectId("5c0685fd6afbd73b80f45338"),
"page_id" : "1234",
"category_list" : [
"football",
"sport"
],
"time_broadcast" : "09:13"
}
{
"_id" : ObjectId("5c0685fd6afbd7355f45338"),
"page_id" : "1234",
"category_list" : [
"sport",
"handball"
],
"time_broadcast" : "09:13"
}
{
"_id" : ObjectId("5c0694ec6afbd74af41ea4af"),
"page_id" : "123456",
"category_list" : [
"news",
"updates"
],
"time_broadcast" : "09:13"
}
....
now = datetime.datetime.now().time().strftime("%H:%M")
我想要的是:当“ time_broadcast”等于“ now”时,我得到每个“ page_id”的不同“ category_list”的列表。
输出内容如下:
{
{
"page_id" : "1234",
"category_list" : ["football", "sport", "handball"]
},
{
"page_id" : "123456",
"category_list" : ["news", "updates"]
}
}
我曾这样尝试过:
category_list = db.users.find({'time_broadcast': now}).distinct("category_list")
但这给我作为不同值的输出列表,但是
所有“ page_id”中的
: ["football", "sport", "handball","news", "updates"]
不是按page_id分类的列表。
请帮忙吗?
谢谢
答案 0 :(得分:1)
您需要编写汇总管道
$match
-按条件过滤文档$group
-按关键字段对文档进行分组$addToSet
-聚集唯一元素$project
-以要求的格式进行项目$reduce
-通过$concatArrays
将数组的数组减少为数组汇总查询
db.tt.aggregate([
{$match : {"time_broadcast" : "09:13"}},
{$group : {"_id" : "$page_id", "category_list" : {$addToSet : "$category_list"}}},
{$project : {"_id" : 0, "page_id" : "$_id", "category_list" : {$reduce : {input : "$category_list", initialValue : [], in: { $concatArrays : ["$$value", "$$this"] }}}}}
]).pretty()
结果
{ "page_id" : "123456", "category_list" : [ "news", "updates" ] }
{
"page_id" : "1234",
"category_list" : [
"sport",
"handball",
"football",
"sport"
]
}
如果需要,您可以通过$sort
管道添加page_id