Question

我是Pyspark的新手，执行以下代码时，出现属性错误。

我正在使用Apache Spark 2.4.3

t=spark.read.format("hdfs:\\test\a.txt")
t.take(1)

我希望输出为1，但会引发错误。

AttributeError: dataframereader object has no attribute take

Answer 1

您没有正确使用API：

在这里，您正在读取文本文件，因此您所要做的就是：

val t = spark.read.text("hdfs://test/a.txt")
t.collect()

查看相关的doc