带有梁的子查询

时间:2018-06-12 14:43:18

标签: haskell haskell-beam

我试图在表的每一行上运行子查询。这是一个最简单的工作示例,其中包含一个表"学生"。

data StudentT  f
  = StudentT
  { _studentId   :: C f Int
  , _studentName :: C f String
  , _score       :: C f Int
  } deriving Generic

type Student = StudentT Identity
type StudentId = PrimaryKey StudentT Identity

deriving instance Show Student

instance Beamable StudentT
instance Beamable (PrimaryKey StudentT)

instance Table StudentT where
  data PrimaryKey StudentT f = StudentId (Columnar f Int) deriving Generic

data SchoolDb f
  = SchoolDb
  { _students :: f (TableEntity StudentT)
  } deriving Generic

instance Database be SchoolDb

schoolDb :: DatabaseSettings be SchoolDb
schoolDb = defaultDbSettings

我想要实现的是这样的查询:

SELECT s.id,
       s.name,
       s.score,
       (SELECT COUNT(*) FROM students AS t where s.score >= t.score) AS percentile 
FROM students as S

我的尝试如下:

main = do
  conn <- open "test.db"
  runBeamSqliteDebug putStrLn conn $ do
    (students :: [(Student, Int)]) <- runSelectReturningList $ select tablePercentile
    liftIO $ mapM_ print students


tablePercentile :: Q _ _ _ _
tablePercentile = do
  student <- all_ (_students schoolDb)
  let percentile =  subquery_ $ aggregate_ (const countAll_) $ filter_ (\s -> _score s <=. (_score student)) (all_ (_students schoolDb))
  return (student, percentile)

有人能指出我正确的方向吗?

编辑:这是完整的错误消息。我认为subquery_返回QGenExpr,因此我将其放入let语句中,而不是绑定它(<-)。这简化了错误消息。

src/Main.hs:52:71: error:
    • Couldn't match type ‘Database.Beam.Query.Internal.QNested s0’
                     with ‘Database.Beam.Query.QueryInaccessible’
      Expected type: Q SqliteSelectSyntax
                       SchoolDb
                       Database.Beam.Query.QueryInaccessible
                       (StudentT
                          (QExpr
                             Database.Beam.Sqlite.Syntax.SqliteExpressionSyntax
                             (Database.Beam.Query.Internal.QNested s0)),
                        QGenExpr
                          QValueContext
                          (Database.Beam.Backend.SQL.SQL92.Sql92SelectTableExpressionSyntax
                             (Database.Beam.Backend.SQL.SQL92.Sql92SelectSelectTableSyntax
                                SqliteSelectSyntax))
                          s0
                          Int)
        Actual type: Q SqliteSelectSyntax
                       SchoolDb
                       (Database.Beam.Query.Internal.QNested s0)
                       (StudentT
                          (QExpr
                             (Database.Beam.Backend.SQL.SQL92.Sql92SelectTableExpressionSyntax
                                (Database.Beam.Backend.SQL.SQL92.Sql92SelectSelectTableSyntax
                                   SqliteSelectSyntax))
                             (Database.Beam.Query.Internal.QNested s0)),
                        QGenExpr
                          QValueContext
                          (Database.Beam.Backend.SQL.SQL92.Sql92SelectTableExpressionSyntax
                             (Database.Beam.Backend.SQL.SQL92.Sql92SelectSelectTableSyntax
                                SqliteSelectSyntax))
                          s0
                          Int)
    • In the first argument of ‘select’, namely ‘tablePercentile’
      In the second argument of ‘($)’, namely ‘select tablePercentile’
      In a stmt of a 'do' block:
        (students :: [(Student, Int)]) <- runSelectReturningList
                                            $ select tablePercentile
   |
52 |     (students :: [(Student, Int)]) <- runSelectReturningList $ select tablePercentile
   |                                                                       ^^^^^^^^^^^^^^^

1 个答案:

答案 0 :(得分:1)

这是我第一次使用Beam,我发现更容易,而不是在这里修复代码,从头开始,使用Your Headers like that作为参考:

tablePercentile =
  aggregate_ (\(student, student') -> (group_ (_studentId student), countAll_))
    . filter_ (\(student, student') -> (_score student <=. _score student'))
    $ (,) <$> all_ (_students schoolDb) <*> all_ (_students schoolDb)

这相当于表格与其自身的内部联接,filter_设置了连接条件,aggregate_处理分组和计数。请注意,此查询仅检索学生ID,而不是完整记录。这是因为通常不可能从GROUP BY - 使用查询获得聚合和用于分组的列。解决这个问题的一种方法是使用子查询来传递ID:

tablePercentile = do
  (sid, cou) <- aggregate_ (\(student, student') -> (group_ (_studentId student), countAll_))
    . filter_ (\(student, student') -> (_score student <=. _score student'))
    $ (,) <$> all_ (_students schoolDb) <*> all_ (_students schoolDb)
  (\student -> (student, cou))
    <$> filter_ (\student -> _studentId student ==. sid) (all_ (_students schoolDb))
-- N.B.: The last line of the do-block might be written as
-- (,) <$> filter_ (\student -> _studentId student ==. sid) (all_ (_students schoolDb)) <*> pure cou

这可以按预期工作:

sqlite> SELECT * from Students;
Id|Name|Score
1|Alice|9
2|Bob|7
3|Carol|6
4|David|8
5|Esther|10
6|Francis|6
GHCi> :main
SELECT "t1"."id" AS "res0", "t1"."name" AS "res1", "t1"."score" AS "res2", "t0"."res1" AS "res3" FROM (SELECT "t0"."id" AS "res0", COUNT(*) AS "res1" FROM "students" AS "t0" INNER JOIN "students" AS "t1" WHERE ("t0"."score")<=("t1"."score") GROUP BY "t0"."id") AS "t0" INNER JOIN "students" AS "t1" WHERE ("t1"."id")=("t0"."res0");
-- With values: []
(StudentT {_studentId = 1, _studentName = "Alice", _score = 9},2)
(StudentT {_studentId = 2, _studentName = "Bob", _score = 7},4)
(StudentT {_studentId = 3, _studentName = "Carol", _score = 6},6)
(StudentT {_studentId = 4, _studentName = "David", _score = 8},3)
(StudentT {_studentId = 5, _studentName = "Esther", _score = 10},1)
(StudentT {_studentId = 6, _studentName = "Francis", _score = 6},6)

在结束语中,根据我的理解,代码中的错误与尝试比较(<=.)条件中无法比较的内容有关。如果percentile被注释掉,您的原始代码(使用filter_的monadic绑定)将编译。它可能与我提到的GROUP BY问题有关,但我不确定。