使用200万行表加入BigQuery中的问题

时间:2014-04-23 16:23:50

标签: google-bigquery

我在BigQuery问题跟踪器中发布了这个问题:(如果问题影响了您,请提出问题)https://code.google.com/p/google-bigquery/issues/detail?id=89&q=join%20each

哪些步骤会重现这个问题?

  
      
  1. 查看职位个人房地产:job_up2I9A31Bo8NSvwD0XTWG2tBoVA
  2.   
  3. 我跑
  4.   
SELECT *        FROM  
      (select *,integer(AD_STREET_NO_PROP) as str_no_prop, integer(CD_ADDR_ZIP_PROP) as CD_ADDR_ZIP_PROP1 from [acris_nyc.nyc_dof_SOA] 
      where NM_RECIPIENT_1 like '%THE MICHAEL R. BLOOMBERG REVOCABLE%') AS s 
        JOIN   each  
      (select *,integer(hnum_lo) as str_num,integer(zip) as zip1 from [acris_nyc.nyc_dof_tc_Tentative_Assessment_Roll] where owner like '%BLOOM%' and txcl = '1') AS a  
      on s.str_no_prop = a.str_num and s.ad_street_1_prop = a.str_name order by NEW_FV_T desc limit 100

预期产量是多少?你怎么看?

  

我希望能够返回一条记录。

包含

  

17作为str_num,“EAST 79 STREET”作为str_name

您使用的是哪个版本的产品?什么操作系统?

  

2014年4月22日来自Chrome浏览器的BigQuery

请在下面提供任何其他信息。

I try a very similar query on a much smaller set of tables and it works as expected.

SELECT *  FROM (select *, integer(number) as inumber from [test_1.table1] where owner like '%BLOOM%') as a join each (select *, integer(number) as inumber from [test_1.table2] where owner like '%BLOOM%') as b on a.inumber=b.inumber and a.street = b.street

returns 

Row a_number    a_street    a_owner a_inumber   b_number    b_street    b_owner b_inumber    
1   00000017    EAST 79 STREET  BLOOMBERG, MICHAEL R    17  17  EAST 79 STREET  THE MICHAEL R. BLOOMBERG REVOCABLE  17   

如果我在100万行的情况下查询各个表,它们包含了在连接完成时应该匹配的数据。

有没有办法调试实际的连接操作?

感谢。

1 个答案:

答案 0 :(得分:2)

只是关闭这个问题的循环;经过调查后发现,数据中的错误是由在浏览器中完成的自动空格删除所掩盖的。有关详细信息,请参阅https://code.google.com/p/google-bigquery/issues/detail?id=89&q=join%20each