Python CSV分组列可列出混合元素的字典

时间:2019-02-06 20:29:56

标签: python pandas jython-2.7

我正在为Websphere开发一个jython脚本,它将接受sys.argv作为list of dict来执行进一步的处理-

在将CSV数据转换为元组列表的字典混合列表时,我需要帮助-

输入CSV-

cluster_name,pool_name,min,max,inactive_time,description,action
Clst1,WebContainer,25,25,60000,Revisit,modify
Clst3,WebContainer,50,50,60000,revisit,modify
Clst6,WebContainer,50,50,60000,revisit,modify
Clst1,ORB.thread.pool,,,,,delete
Clst3,ORB.thread.pool,,,,,delete`

我正在尝试使用熊猫对列进行分组,但是无法创建混合元素字典

需要在对象下方(混合元素的字典列表)

[
 {cluster_name:'Clst1',
  pool_name:[
         (WebContainer,25,25,60000,Revisit,modify),
         (ORB.thread.pool,,,,,delete)]},
 {cluster_name:'Clst3',
  pool_name:[
         (WebContainer,50,50,60000,revisit,modify), 
         (ORB.thread.pool,,,,,delete)]},
 {cluster_name:'Clst6',
  pool_name:[
         (WebContainer,50,50,60000,revisit,modify)
        ]}
]

这样我就可以将此对象作为sys.argv来使用jython脚本。

1 个答案:

答案 0 :(得分:1)

尝试:

from io import StringIO
import pandas as pd

csvfile = StringIO("""cluster_name,pool_name,min,max,inactive_time,description,action
Clst1,WebContainer,25,25,60000,Revisit,modify
Clst3,WebContainer,50,50,60000,revisit,modify
Clst6,WebContainer,50,50,60000,revisit,modify
Clst1,ORB.thread.pool,,,,,delete
Clst3,ORB.thread.pool,,,,,delete""")

df = pd.read_csv(csvfile)

s = df.set_index(['cluster_name']).apply(tuple, axis=1).rename('pool_name').groupby(level=0).agg(list).reset_index()

s.to_json(orient='records')

输出:

[{"cluster_name":"Clst1","pool_name":[["WebContainer",25.0,25.0,60000.0,"Revisit","modify"],["ORB.thread.pool",null,null,null,null,"delete"]]},{"cluster_name":"Clst3","pool_name":[["WebContainer",50.0,50.0,60000.0,"revisit","modify"],["ORB.thread.pool",null,null,null,null,"delete"]]},{"cluster_name":"Clst6","pool_name":[["WebContainer",50.0,50.0,60000.0,"revisit","modify"]]}]