使用窗口函数在Postgres上使用SqlAlchemy限制查询

时间:2013-07-02 23:14:40

标签: python sql sqlalchemy

我正在尝试使用sqlalchemy ORM编写以下sql查询:

SELECT * FROM
   (SELECT *, row_number() OVER(w)
    FROM (select distinct on (grandma_id, author_id) * from contents) as c
    WINDOW w AS (PARTITION BY grandma_id ORDER BY RANDOM())) AS v1
WHERE row_number <= 4;

这是我到目前为止所做的:

s = Session()

unique_users_contents = (s.query(Content).distinct(Content.grandma_id,
                                                  Content.author_id)
                         .subquery())

windowed_contents = (s.query(Content,
                             func.row_number()
                             .over(partition_by=Content.grandma_id,
                                   order_by=func.random()))
                     .select_from(unique_users_contents)).subquery()

contents = (s.query(Content).select_from(windowed_contents)
            .filter(row_number >= 4)) ##  how can I reference the row_number() value?

result = contents
for content in result:
    print "%s\t%s\t%s" % (content.id, content.grandma_id,
                          content.author_id)

正如您所看到的那样,它几乎是建模的,但我不知道如何从外部查询中引用子查询的row_number()结果。我试过像windowed_contents.c.row_number这样的东西,并在窗口函数上添加label()调用,但是它没有用,在官方文档或stackoverflow中找不到任何类似的例子。

如何实现这一目标?而且,你能建议一个更好的方法来进行这个查询吗?

1 个答案:

答案 0 :(得分:19)

windowed_contents.c.row_number针对label()你是怎么做的,对我有用(注意select_entity_from()方法是SQLA 0.8.2中的新方法,这里需要0.9 vs vs 。select_from()):

from sqlalchemy import *
from sqlalchemy.orm import *
from sqlalchemy.ext.declarative import declarative_base

Base = declarative_base()

class Content(Base):
    __tablename__ = 'contents'

    grandma_id = Column(Integer, primary_key=True)
    author_id = Column(Integer, primary_key=True)


s = Session()

unique_users_contents = s.query(Content).distinct(
                            Content.grandma_id, Content.author_id).\
                            subquery('c')

q = s.query(
        Content,
        func.row_number().over(
                partition_by=Content.grandma_id,
                order_by=func.random()).label("row_number")
    ).select_entity_from(unique_users_contents).subquery()

q = s.query(Content).select_entity_from(q).filter(q.c.row_number <= 4)

print q