pyspark:类型对象'Row'没有属性'fromSeq'

时间:2016-05-27 03:26:30

标签: python dataframe pyspark

我有以下代码:

from pyspark.sql import Row
z1=["001",1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,30,41,42,43]
print z1
r1 = Row.fromSeq(z1)
print (r1)

然后我收到了错误:

AttributeError                            Traceback (most recent call last)
<ipython-input-6-fa5cf7d26ed0> in <module>()
      2 z1=["001",1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,30,41,42,43]
      3 print z1
----> 4 r1 = Row.fromSeq(z1)
      5 
      6 print (r1)

AttributeError: type object 'Row' has no attribute 'fromSeq'

任何人都知道我可能错过了什么?谢谢!

1 个答案:

答案 0 :(得分:0)

如果您不提供姓名,请使用tuple

tuple(z1)

这就是构建正确的DataFrame

所需的全部内容