我的PySpark数据框具有以下架构:
location /search
schema = spark_df.printSchema()
我想在架构中再添加一个StructField,因此新架构看起来像:
root
|-- field_1: double (nullable = true)
|-- field_2: double (nullable = true)
|-- field_3 (nullable = true)
|-- field_4: double (nullable = true)
|-- field_5: double (nullable = true)
|-- field_6: double (nullable = true)
我知道我可以手动创建如下所示的new_schema:
root
|-- field_1: double (nullable = true)
|-- field_1: double (nullable = true)
|-- field_2: double (nullable = true)
|-- field_3 (nullable = true)
|-- field_4: double (nullable = true)
|-- field_5: double (nullable = true)
|-- field_6: double (nullable = true)
这适用于少数字段但如果我有数百个字段则无法生成。所以我想知道是否有更优雅的方式将新字段添加到模式的开头?谢谢!
答案 0 :(得分:7)
您可以复制现有字段和附加信息:
Connection failed: Access denied for user ''@'176.31.33.116' (using password: NO)