优化hsqldb

时间:2016-07-15 13:23:19

标签: hsqldb

我有一个700K +记录表,一个简单的GROUP BY查询执行时间超过35秒。我对如何优化它没有想法。

SELECT TOP 10 called_dn, COUNT(called_dn) FROM reportview.calls_out GROUP BY called_dn;
  • 我在此处添加TOP 10以限制网络传输引起的延迟。
  • 我有called_dn的索引(hsqldb好像没有使用它)。
  • called_dn不可为空。
  • reportview.calls_out是一个缓存表。

这是表格脚本:

CREATE TABLE calls_out (
  pk_global_call_id INTEGER GENERATED BY DEFAULT AS SEQUENCE seq_global_call_id NOT NULL,
  sys_global_call_id VARCHAR(65),
  call_start TIMESTAMP WITH TIME ZONE NOT NULL,
  call_end TIMESTAMP WITH TIME ZONE NOT NULL,
  duration_interval INTERVAL HOUR TO SECOND(0),
  duration_seconds INTEGER,
  call_segments INTEGER,
  calling_dn VARCHAR(25) NOT NULL,
  called_dn VARCHAR(25) NOT NULL,
  called_via_dn VARCHAR(25),
  fk_end_status INTEGER NOT NULL,
  fk_incoming_queue INTEGER,
  call_start_year INTEGER,
  call_start_month INTEGER,
  call_start_week INTEGER,
  call_start_day INTEGER,
  call_start_hour INTEGER,
  call_start_minute INTEGER,
  call_start_second INTEGER,
  utc_created TIMESTAMP WITH TIME ZONE,
  created_by VARCHAR(25),
  utc_modified TIMESTAMP WITH TIME ZONE,
  modified_by VARCHAR(25),
  PRIMARY KEY (pk_global_call_id),
  FOREIGN KEY (fk_incoming_queue)
    REFERENCES lookup_incoming_queue(pk_id),
  FOREIGN KEY (fk_end_status)
    REFERENCES lookup_end_status(pk_id));

我是否仍然坚持这种表现,或者有什么我可能试图加快这个问题?

编辑:如果有帮助,这是查询计划:

isDistinctSelect=[false]
isGrouped=[true]
isAggregated=[true]
columns=[  COLUMN: REPORTVIEW.CALLS_OUT.CALLED_DN not nullable
  COUNT  arg=[   COLUMN: REPORTVIEW.CALLS_OUT.CALLED_DN nullable]
  [range variable 1
    join type=INNER
    table=CALLS_OUT
    cardinality=771855
    access=FULL SCAN
    join condition = [index=SYS_IDX_SYS_PK_10173_10177]]]
groupColumns=[COLUMN: REPORTVIEW.CALLS_OUT.CALLED_DN]
offset=[VALUE = 0, TYPE = INTEGER]
limit=[VALUE = 10, TYPE = INTEGER]
PARAMETERS=[]
SUBQUERIES[]

1 个答案:

答案 0 :(得分:0)

好吧,因为在这种情况下似乎没有办法避免全列扫描。

仅仅是为了参考这个问题的未来灵魂,这是我最终采取的行动:

在原始表中创建由INSERT / DELETE个触发器维护的摘要表。这与合适的索引相结合,并在我的查询中使用LIMIT USING INDEX子句可以产生非常好的性能。