Pyspark - 将数据帧写入csv

时间:2018-06-10 05:25:25

标签: python-3.x csv apache-spark pyspark apache-spark-sql

我正在尝试将pyspark数据帧写入csv文件,但我面临的问题是datetype字段被转换为IntergerType。

Spark version : 2.0.0

代码:

  df.repartition(1).write.format("com.databricks.spark.csv").mode("overwrite")\
 .option("dateFormat", "mm/dd/yyyy hh:mm:ss").option("escape", "\\").option("header", "true").save(temp_csv_file)  


**Dataframe:**

Please note Date_1 field

+---------+-----+----+----------+------------+---------+
|Store_seq|Store|Dept|    Date_1|Weekly_Sales|IsHoliday|
+---------+-----+----+----------+------------+---------+
|   273048|   28|  49|2010-03-05|    16845.12|    FALSE|
|   323620|   34|  24|2010-06-25|     2123.00|    FALSE|
|   318053|   33|  60|2012-01-20|      176.00|    FALSE|
|   149167|   16|   9|2010-02-12|    11518.21|     TRUE|
|   292767|   30|  92|2011-07-08|    47044.41|    FALSE|
+---------+-----+----+----------+------------+---------+

**csv file:**

+---------+-----+----+------+------------+---------+
|Store_seq|Store|Dept|Date_1|Weekly_Sales|IsHoliday|
+---------+-----+----+------+------------+---------+
|   273048|   28|  49| 14673|    16845.12|    false|
|   323620|   34|  24| 14785|      2123.0|    false|
|   318053|   33|  60| 15359|       176.0|    false|
|   149167|   16|   9| 14652|    11518.21|     true|
|   292767|   30|  92| 15163|    47044.41|    false|
+---------+-----+----+------+------------+---------+

0 个答案:

没有答案