PySpark数据框在自定义联接条件下联接

时间:2019-09-17 23:04:38

标签: dataframe join pyspark

我有df_a, df_b,我想按自定义条件加入他们:coalesce(df_a.id, 0) + 1 == df_b.id,我应该如何编写代码?

我尝试了df_joined = df_a.join(df_b, coalesce(df_a.id, 0) + 1 == df_b.id

但是出现错误:Invalid argument, not a string or column: 0 of type <type 'int'>. For column literals, use 'lit', 'array', 'struct' or 'create_map' function

0 个答案:

没有答案