Question

提前感谢您的帮助。

我有两个实体，Human和Chimp。每个都有一组指标，可以包含MetricBlock的子类，例如CompleteBloodCount（包含字段WHITE_CELLS，RED_CELLS，PLATELETS）。

所以我的对象模型看起来像（原谅ASCII艺术）：

---------   metrics    ---------------      ----------------------
| Human | ---------->  | MetricBlock | <|-- | CompleteBloodCount |
---------              ---------------      ----------------------
                        ^
---------    metrics    |
| Chimp | --------------
---------

使用以下表格实现：

Chimp (id, …)
Human (id, …)

MetricBlock (id, dtype)
CompleteBloodCount (id, white_cells, red_cells, platelets)
CholesterolCount (id, hdl, ldl)

ChimpToMetricBlock(chimp_id, metric_block_id)
HumanToMetricBlock(human_id, metric_block_id)

所以人类知道它的度量块，但是度量块不知道它的人或黑猩猩。

我想在SQLAlchemy中编写一个查询来查找特定人类的所有CompleteBloodCounts。在SQL中，我可以编写如下内容：

SELECT cbc.id
FROM complete_blood_count cbc
WHERE EXISTS (
   SELECT 1
   FROM human h
       INNER JOIN human_to_metric_block h_to_m on h.id = h_to_m.human_id
   WHERE
       h_to_m.metric_block_id = cbc.id
)

我正在努力在SQLAlchemy中写这个。我相信correlate（），any（）或别名连接可能会有所帮助，但是MetricBlock不知道它的Human或Chimp是我的绊脚石。

有没有人对如何编写此查询有任何建议？或者，是否还有其他策略可以使用SQLAlchemy更好地定义模型？

感谢您的协助。

Python 2.6
SQLAlchemy 0.7.4
Oracle 11g

编辑：

HumanToMetricBlock定义为：

humanToMetricBlock = Table(
    "human_to_metric_block",
    metadata,
    Column("human_id", Integer, ForeignKey("human.id"),
    Column("metric_block_id", Integer, ForeginKey("metric_block.id")
)

per the manual

Answer 1

每个灵长类动物都应该有一个唯一的ID，无论它们是什么类型的灵长类动物。我不确定为什么每组属性（MB，CBC，CC）都是单独的表，但我认为它们有多个维度（灵长类动物），如时间，否则我只会有一个巨大的表。

因此，我将以下列方式构建此问题：创建一个父对象灵长类动物并从中获取人类和黑猩猩。此示例使用单表继承，但您可能希望根据其属性使用联接表继承。

class Primate(Base):
    __tablename__ = 'primate'
    id = Column(Integer, primary_key=True)
    genus = Column(String)
    ...attributes all primates have...
    __mapper_args__ = {'polymorphic_on': genus, 'polymorphic_identity': 'primate'}

class Chimp(Primate):
    __mapper_args__ = {'polymorphic_identity': 'chimp'}
    ...attributes...

class Human(Primate):
    __mapper_args__ = {'polymorphic_identity': 'human'}
    ...attributes...

class MetricBlock(Base):
    id = ...

然后创建一个多对多表（您可以使用关联代理）：

class PrimateToMetricBlock(Base):
    id = Column(Integer, primary_key=True) # primary key is needed!
    primate_id = Column(Integer, ForeignKey('primate.id'))
    primate = relationship('Primate') # If you care for relationships. 
    metricblock_id = Column(Integer, ForeignKey('metric_block.id')
    metricblock = relationship('MetricBlock')

然后我会像这样构造查询（注意on子句是没有必要的，因为SQLAlchemy可以自动推断关系，因为没有歧义）：

query = DBSession.query(CompleteBloodCount).\
    join(PrimateToMetricBlock, PrimateToMetricBlock.metricblock_id == MetricBlock.id)

如果您想按灵长类动物类型进行过滤，请加入灵长类表并过滤：

query = query.join(Primate, Primate.id == PrimateToMetricBlock.primate_id).\
    filter(Primate.genus == 'human')

否则，如果您知道灵长类动物的ID（primate_id），则无需其他连接：

query = query.filter(PrimateToMetricBlock.primate_id == primate_id)

如果您只检索一个对象，请使用以下命令结束查询：

return query.first()

否则：

return query.all()

形成这样的模型应该可以消除任何混淆，实际上使一切变得更简单。如果我遗失了什么，请告诉我。

SQLAlchemy：单向关系，相关子查询

1 个答案: