熊猫.str.extract不适用于字符串

时间:2019-12-26 10:02:28

标签: python-3.x string pandas extract

我的单元格内容如下:

{'Code':'YewrH','Owner':'yyy.xxx@cccc.com','Environment_Type':'Prod','ID':'23000','InfoSeC':'Internal', 'PDDA':'Basic','Level':'Basic','Contact':'czcxc.zxcxc@yyy.com'}

单元格位于称为标签的列中。所以我尝试:

df = pd.read_excel('input.xlsx', header=0)

# Data as it is (DateTime objects and Timestamp objects)
data = df.to_numpy() 
# Or convert all entries to strings
data = df.astype(str).to_numpy()

df['counter'] = 1
counter = df[['counter','Date']].groupby('Date').transform(lambda x: x.cumsum()).to_numpy()

我希望得到正则表达式以返回23000,但我得到的唯一信息是:NaN。有人可以帮忙吗?

我发现这可行:          tmp ['tags'] = tmp ['tags']。astype(str)          tmp ['APM_id'] = tmp ['tags']。str.extract(r'ID \':\ s \'(\ d *)\'',expand = True)

1 个答案:

答案 0 :(得分:0)

您没有为:使用转义字符。简单地做,

tmp['tags'].str.extract(r'ID\'\:\s\'(\d*)\'')