我的单元格内容如下:
{'Code':'YewrH','Owner':'yyy.xxx@cccc.com','Environment_Type':'Prod','ID':'23000','InfoSeC':'Internal', 'PDDA':'Basic','Level':'Basic','Contact':'czcxc.zxcxc@yyy.com'}
单元格位于称为标签的列中。所以我尝试:
df = pd.read_excel('input.xlsx', header=0)
# Data as it is (DateTime objects and Timestamp objects)
data = df.to_numpy()
# Or convert all entries to strings
data = df.astype(str).to_numpy()
df['counter'] = 1
counter = df[['counter','Date']].groupby('Date').transform(lambda x: x.cumsum()).to_numpy()
我希望得到正则表达式以返回23000,但我得到的唯一信息是:NaN。有人可以帮忙吗?
我发现这可行: tmp ['tags'] = tmp ['tags']。astype(str) tmp ['APM_id'] = tmp ['tags']。str.extract(r'ID \':\ s \'(\ d *)\'',expand = True)
答案 0 :(得分:0)
您没有为:
使用转义字符。简单地做,
tmp['tags'].str.extract(r'ID\'\:\s\'(\d*)\'')