Question

我有两个SQLite数据库，包含我需要使用SQLalchemy加入的表。由于原因，我无法将所有表组合到一个SQLite数据库中。我正在使用SQLalchemy ORM。我无法在网上找到符合我特定情况的任何解决方案。

我的问题原则上与SQLAlchemy error query join across database相同，但原始海报的问题是使用与我的用例不符的其他解决方案解决的。

我向Stackoverflow的聪明人提问：

我想模拟以下SQL查询：

SELECT DISTINCT g.gene_symbol, o.orthofinder_id FROM eukarya.genes AS g JOIN annotations.orthofinder AS o ON g.gene_id=o.gene_id;

使用附加了两个数据库文件的SQliteStudio，此查询可以正常工作。

我目前用于描述元数据的代码：

eukarya_engine = create_engine('sqlite:///eukarya_db.sqlite3')
annotations_engine = create_engine('sqlite:///eukarya_annotations_db.sqlite3')

meta = MetaData()  # This allows me to define cross database foreign keys

Eukarya = declarative_base(bind=eukarya_engine, metadata=meta)
Annotations = declarative_base(bind=annotations_engine, metadata=meta)
# I did the above in the hopes that by binding the engines this way,
# would percolate through the schema, and sqlalchemy would be able
# figure out which engine to use for each table.

class Genes(Eukarya):
  """SQLalchemy object representing the Genes table in the Eukarya database."""
  __tablename__ = 'genes'
  gene_id = Column(Integer, primary_key=True, unique=True)
  gene_symbol = Column(String(16), index=True)
  taxonomy_id = Column(Integer, ForeignKey(Species.taxonomy_id), index=True)
  original_gene_id = Column(String)

class Orthofinder(Annotations):
  """SQLalchemy object representing the Orthofinder table in the Annotations database."""
  __tablename__ = 'orthofinder'
  id = Column(Integer,primary_key=True, autoincrement=True)
  gene_id = Column(Integer, ForeignKey(Genes.gene_id), index=True)
  orthofinder_id = Column(String(10), index=True)

Session = sessionmaker()
session = Session(bind=eukarya_engine)

print(session.query(Genes.gene_symbol,Orthofinder.orthofinder_id).
      join(Orthofinder).all().statement)

最后一个print语句返回：

sqlalchemy.exc.OperationalError: (sqlite3.OperationalError) no such table: orthofinder [SQL: 'SELECT genes.gene_symbol AS genes_gene_symbol, orthofinder.orthofinder_id AS orthofinder_orthofinder_id \nFROM genes JOIN orthofinder ON genes.gene_id = orthofinder.gene_id']

如果我能以某种方式将两个数据库引擎绑定到一个会话，我相信我的麻烦已经结束了。但是怎么样？对于连接到同一引擎的两个数据库（例如MySQL数据库中的两个数据库），我可以添加__table_args__ = {'schema': 'annotations'}（根据Cross database join in sqlalchemy），但我无法在SQLite的情况下解决这个问题。

我更喜欢一种解决方案，允许我的代码用户构建查询，而不必知道每个表所在的数据库。

请帮忙！非常感谢提前！

Answer 1

回答我自己的问题（感谢Ilja寻找解决方案）：

我可以像这样定义引擎：

engine  = create_engine('sqlite://',echo=True)  # generate in mem database to attach mutitple sqlite databases to.
engine.execute("attach database 'eukarya_db.sqlite3' as eukarya;")
engine.execute("attach database 'eukarya_annotations_db.sqlite3' as annotations;")

然后添加

__table_args__ = {'schema': 'eukarya'}

或

__table_args__ = {'schema': 'annotations'}

到我的班级。

像魅力一样！

使用SQLalchemy ORM和SQLite

1 个答案: