如何在不插入任何记录的情况下通过配置单元表架构推断实木复合地板架构?

时间:2018-11-05 03:15:55

标签: hadoop hive parquet

现在给定一个带有其架构的配置单元表,即:

>>> df
        Date  Transaction
0  6/15/2006        -4.27
1  6/16/2006        -2.27
2  6/19/2006        -6.35

df['Date'] = pd.to_datetime(df['Date'],format='%m/%d/%Y').dt.strftime('%Y%m%d')

>>> df
       Date  Transaction
0  20060615        -4.27
1  20060616        -2.27
2  20060619        -6.35

如何在不插入任何记录的情况下推断其实木复合地板架构?

实木复合地板架构如下:

hive> show create table nba_player;
OK
CREATE TABLE `nba_player`(
  `id` bigint, 
  `player_id` bigint, 
  `player_name` string, 
  `admission_time` timestamp, 
  `nationality` string)
ROW FORMAT SERDE 
  'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' 
STORED AS INPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat' 
OUTPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
LOCATION
  'hdfs://endpoint:8020/user/hive/warehouse/nba_player'
TBLPROPERTIES (
  'transient_lastDdlTime'='1541140811')
Time taken: 0.022 seconds, Fetched: 16 row(s)

0 个答案:

没有答案