使用Django ORM,可以执行queryset.objects.annotate(Count('queryset_objects', gte=VALUE))
之类的操作。抓住我的漂移?
这是一个用于说明可能答案的简单示例:
在Django网站中,内容创建者提交文章,并且普通用户查看(即阅读)所述文章。文章可以发表(即可供所有人阅读),也可以草稿模式。描述这些要求的模型是:
class Article(models.Model):
author = models.ForeignKey(User)
published = models.BooleanField(default=False)
class Readership(models.Model):
reader = models.ForeignKey(User)
which_article = models.ForeignKey(Article)
what_time = models.DateTimeField(auto_now_add=True)
我的问题是:如何获取所有发布的文章,按照过去30分钟的独特读者排序?即我想要计算每个发表的文章在过去半小时内获得的不同(独特)视图的数量,然后生成按这些不同视图排序的文章列表。
我试过了:
date = datetime.now()-timedelta(minutes=30)
articles = Article.objects.filter(published=True).extra(select = {
"views" : """
SELECT COUNT(*)
FROM myapp_readership
JOIN myapp_article on myapp_readership.which_article_id = myapp_article.id
WHERE myapp_readership.reader_id = myapp_user.id
AND myapp_readership.what_time > %s """ % date,
}).order_by("-views")
这引发了错误:语法错误在或附近" 01" (其中" 01"是额外的日期时间对象)。继续下去并不多。
答案 0 :(得分:88)
from django.db.models import Count, Case, When, IntegerField
Article.objects.annotate(
numviews=Count(Case(
When(readership__what_time__lt=treshold, then=1),
output_field=IntegerField(),
))
)
<强>说明:强>
通过您的文章进行的正常查询将使用numviews
字段进行注释。该字段将被构造为CASE / WHEN表达式,由Count包装,对于读者匹配标准将返回1,对于不匹配标准的读者将返回NULL
。 Count将忽略空值并仅计算值。
对于最近未被查看的文章,您将获得零,并且您可以使用该numviews
字段进行排序和过滤。
PostgreSQL背后的查询将是:
SELECT
"app_article"."id",
"app_article"."author",
"app_article"."published",
COUNT(
CASE WHEN "app_readership"."what_time" < 2015-11-18 11:04:00.000000+01:00 THEN 1
ELSE NULL END
) as "numviews"
FROM "app_article" LEFT OUTER JOIN "app_readership"
ON ("app_article"."id" = "app_readership"."which_article_id")
GROUP BY "app_article"."id", "app_article"."author", "app_article"."published"
如果我们只想跟踪唯一查询,我们可以将区别添加到Count
中,并使我们的When
子句返回值,我们希望区分开来。
from django.db.models import Count, Case, When, CharField, F
Article.objects.annotate(
numviews=Count(Case(
When(readership__what_time__lt=treshold, then=F('readership__reader')), # it can be also `readership__reader_id`, it doesn't matter
output_field=CharField(),
), distinct=True)
)
这会产生:
SELECT
"app_article"."id",
"app_article"."author",
"app_article"."published",
COUNT(
DISTINCT CASE WHEN "app_readership"."what_time" < 2015-11-18 11:04:00.000000+01:00 THEN "app_readership"."reader_id"
ELSE NULL END
) as "numviews"
FROM "app_article" LEFT OUTER JOIN "app_readership"
ON ("app_article"."id" = "app_readership"."which_article_id")
GROUP BY "app_article"."id", "app_article"."author", "app_article"."published"
您可以使用raw
来执行由较新版本的django创建的SQL语句。显然,没有使用raw
查询数据的简单优化方法(即使使用extra
,注入所需的JOIN
子句也存在一些问题。
Articles.objects.raw('SELECT'
' "app_article"."id",'
' "app_article"."author",'
' "app_article"."published",'
' COUNT('
' DISTINCT CASE WHEN "app_readership"."what_time" < 2015-11-18 11:04:00.000000+01:00 THEN "app_readership"."reader_id"'
' ELSE NULL END'
' ) as "numviews"'
'FROM "app_article" LEFT OUTER JOIN "app_readership"'
' ON ("app_article"."id" = "app_readership"."which_article_id")'
'GROUP BY "app_article"."id", "app_article"."author", "app_article"."published"')
答案 1 :(得分:19)
对于django&gt; = 2.0,您可以在聚合函数中使用Conditional aggregation with a filter
argument:
from datetime import timedelta
from django.utils import timezone
from django.db.models import Count, Q # need import
Article.objects.annotate(
numviews=Count(
'readership__reader__id',
filter=Q(readership__what_time__gt=timezone.now() - timedelta(minutes=30)),
distinct=True
)
)