BigQuery给出了#34;响应太大"对小表的COUNT(*)

时间:2013-01-14 16:23:42

标签: google-bigquery

我有一个包含以下7个每日汇总表的数据集,每个汇总表都很小:

summary_broad_20130104_hourly         199 k rows    8395 KB
summary_broad_20130105_hourly         198 k rows    8347 KB
summary_broad_20130106_hourly         197 k rows    8319 KB
summary_broad_20130107_hourly         194 k rows    8190 KB
summary_broad_20130108_hourly         197 k rows    8338 KB
summary_broad_20130109_hourly         201 k rows    8490 KB
summary_broad_20130110_hourly         202 k rows    8534 KB

在所有这些表(04-10)上执行简单的select count(*)时,我得到“响应太大”:

$ bq query "select count(*) from summary_broad_20130104_hourly,summary_broad_20130105_hourly,summary_broad_20130106_hourly,summary_broad_20130107_hourly,summary_broad_20130108_hourly,summary_broad_20130109_hourly,summary_broad_20130110_hourly"
BigQuery error in query operation: Error processing job '987928363675:job_bb504ce6ea31416db3f8762d3c244d93': Response too large to return.

但是,如果我将查询限制为表04-06:

,我会得到成功的响应
$ bq query "select count(*) from summary_broad_20130104_hourly,summary_broad_20130105_hourly,summary_broad_20130106_hourly"
+--------+
|  f0_   |
+--------+
| 594721 |
+--------+

但如果我延伸到04-07表,那么我可以再次重新发出错误:

$ bq query "select count(*) from summary_broad_20130104_hourly,summary_broad_20130105_hourly,summary_broad_20130106_hourly,summary_broad_20130107_hourly"
BigQuery error in query operation: Error processing job '987928363675:job_22ea222b4b0549a98bb9e90f1a2ac539': Response too large to return.

这是一个错误,还是我做错了什么?查询的数据很小(04-07案例中约为40MB),我希望select count(*)是一个节点本地聚合,中间数据量可以忽略不计。

1 个答案:

答案 0 :(得分:0)

事实上,这是一个错误。我们正在调查它,可以在本地重复它,并希望很快得到修复。