Hive中的日期差异小于15分钟

时间:2012-07-21 02:45:29

标签: hadoop mapreduce hive hiveql

以下是我的查询,其中在最后一行我试图查看日期之间的差异是否在15分钟之内。但每当我运行以下查询时。

SELECT TT.BUYER_ID , COUNT(*) FROM
(SELECT testingtable1.buyer_id, testingtable1.item_id, testingtable1.created_time from (select user_id, prod_and_ts.product_id as product_id, prod_and_ts.timestamps as timestamps from testingtable2 LATERAL VIEW explode(purchased_item) exploded_table as prod_and_ts where to_date(from_unixtime(cast(prod_and_ts.timestamps as BIGINT))) = '2012-07-09') prod_and_ts RIGHT OUTER JOIN (SELECT buyer_id, item_id, rank(buyer_id), created_time, UNIX_TIMESTAMP(created_time)
FROM (
    SELECT buyer_id, item_id, created_time
    FROM testingtable1
    where to_date(from_unixtime(cast(UNIX_TIMESTAMP(created_time) as int))) = '2012-07-09'
    DISTRIBUTE BY buyer_id
    SORT BY buyer_id, created_time desc
) a
WHERE rank(buyer_id) < 5) testingtable1 ON (testingtable1.item_id = prod_and_ts.product_id AND testingtable1.BUYER_ID = prod_and_ts.USER_ID 
AND abs(datediff(testingtable1.created_time,FROM_UNIXTIME(cast(prod_and_ts.timestamps as BIGINT)))) <= 15) where prod_and_ts.product_id IS NULL ORDER BY testingtable1.buyer_id, testingtable1.created_time desc) TT GROUP BY TT.BUYER_ID;

我总是得到例外 -

FAILED: Error in semantic analysis: line 10:144 Both Left and Right Aliases
Encountered in Join 15

我的查询有什么问题吗?或者在Hive中我们无法用分钟计算日期之间的差异?任何建议将不胜感激。

1 个答案:

答案 0 :(得分:0)

我认为问题在于您的加入。来自the Hive language manual

  

仅支持等同连接,外连接和左半连接   蜂巢。 Hive不支持不相等的连接条件   因为很难表达这样的条件   地图/减少工作。