用于火花箱类的scala通用编码器

时间:2017-05-29 17:36:10

标签: scala apache-spark generics apache-spark-dataset apache-spark-encoders

如何编译此方法?奇怪的是,隐含的火花已经被导入了。

def loadDsFromHive[T <: Product](tableName: String, spark: SparkSession): Dataset[T] = {
    import spark.implicits._
    spark.sql(s"SELECT * FROM $tableName").as[T]
  }

这是错误:

Unable to find encoder for type stored in a Dataset.  Primitive types (Int, String, etc) and Product types (case classes) are supported by importing spark.implicits._  Support for serializing other types will be added in future releases.
[error]     spark.sql(s"SELECT * FROM $tableName").as[T]

1 个答案:

答案 0 :(得分:14)

根据org.apache.spark.sql.SQLImplicits的源代码,您需要类型的类型TypeTag,以便隐式Encoder存在:

import scala.reflect.runtime.universe.TypeTag
def loadDsFromHive[T <: Product: TypeTag](tableName: String, spark: SparkSession): Dataset[T] = ...