获取Django最近租借的电影

时间:2012-01-13 17:01:36

标签: python sql django

所以想象你有以下两个表:

CREATE movies (
    id int,
    name varchar(255),
    ...
    PRIMARY KEY (id)
);

CREATE movieRentals (
    id int,
    movie_id int,
    customer varchar(255),
    dateRented datetime,
    ...
    PRIMARY KEY (id)
    FOREIGN KEY (movie_id) REFERENCES movies(id)
);

直接使用SQL,我将此查询视为:

(
    SELECT movie_id, count(movie_id) AS rent_count
    FROM movieRentals
    WHERE  dateRented > [TIME_ARG_HERE]
    GROUP BY movie_id
)
UNION
(
    SELECT id AS movie_id, 0 AS rent_count
    FROM movie
    WHERE movie_id NOT IN
    (
        SELECT movie_id
        FROM movieRentals
        WHERE dateRented > [TIME_ARG_HERE]
        GROUP BY movie_id
    )
)

(按照给定日期的ID,计算所有电影租赁的数量)

显然,这些表的Django版本是简单的模型:

class Movies(models.Model):
    name = models.CharField(max_length=255, unique=True)

class MovieRentals(models.Model):
    customer = models.CharField(max_length=255)
    dateRented = models.DateTimeField()
    movie = models.ForeignKey(Movies)

但是,将其转换为等效查询似乎很困难:

timeArg = datetime.datetime.now() - datetime.timedelta(7,0)
queryset = models.MovieRentals.objects.all()
queryset = queryset.filter(dateRented__gte=timeArg)
queryset = queryset.annotate(rent_count=Count('movies'))

querysetTwo = models.Movies.objects.all()
querysetTwo = querysetTwo.filter(~Q(id__in=[val["movie_id"] for val in queryset.values("movie_id")]))
# Somehow need to set the 0 count. For now force it with Extra:
querysetTwo.extra(select={"rent_count": "SELECT 0 AS rent_count FROM app_movies LIMIT 1"})

# Now union these - for some reason this doesn't work:
# return querysetOne | querysetTwo
# so instead
set1List = [_getMinimalDict(model) for model in queryset]
# Where getMinimalDict just extracts the values I am interested in.
set2List = [_getMinimalDict(model) for model in querysetTwo]
return sorted(set1List + set2List, key=lambda x: x['rent_count'])

然而,虽然这种方法似乎有效,但速度非常慢。我错过了更好的方式吗?

2 个答案:

答案 0 :(得分:1)

我必须遗漏一些明显的东西。为什么不能做以下工作:

queryset = models.MovieRentals.filter(dateRented__gte=timeArg).values('movies').annotate(Count('movies')).aggregate(Min('movies__count'))

此外,子句可以链接(如上面的代码所示),因此没有理由不断地将queryset变量设置为中间查询集。

答案 1 :(得分:1)

使用直接SQL,这将更容易表达如下:

SELECT movie.id, count(movieRentals.id) as rent_count
FROM movie
LEFT JOIN movieRentals ON (movieRentals.movie_id = movie.id AND dateRented > [TIME_ARG_HERE])
GROUP BY movie.id

自[TIME_ARG_HERE]以来,左连接将为每个未启用的电影生成一行,但在这些行中,movieRentals.id列将为NULL。

然后,COUNT(movieRentals.id)将计算它们存在的所有租约,如果只有NULL值则返回0.