Question

我正在将不同的excel文件合并到csv文件中。源文件中的一列（长度）中的值包含单引号（例如＆＃39; 200，＆＃39; 50等）。某些值也可以在末尾包含句点（例如，＆＃39; 200。，＆＃39; 50。，＆＃39; 10.3等）。我想只删除值中的单引号。

输入

Length
=======
'2000

'100.

'10.3

所需的输出

Length
=======
2000

100.

10.3

我使用以下代码但不知何故它也从值中删除句点（。）。请帮忙。

import pandas as pd
import glob

path= input("Enter the location of files ")

GLB_DM_VER = input("Enter global DM version")

GLB_DM_ENV = input("Enter the global DM version environment")

file_list = glob.glob(path+"\*.xls")

excels = [pd.ExcelFile(name) for name in file_list] 

frames = [x.parse(x.sheet_names[2], header=0,index_col=None) for x in excels]

combined = pd.concat(frames)

**combined['LENGTH'].replace(regex=True,inplace=True,to_replace=r'\'',value=r'')**

combined.to_csv("STAND_2.csv", header=['Global_DM_VERSION_ID','Global_DM_VERSION_ENV','TARGET_DOMAIN','SOURCE_DOMAIN','DOMAIN_LABEL','SOURCE_VARIABLE','RAVE_LABEL','TYPE','VARIABLE_LENGTH','CONTROL_TYPE','CODELIST_OID','TARGET_VARIABLE','MANDATORY','RAVE_ORIGIN'], index=False)

如何从Python中的DataFrame列中删除选定的特殊字符

0 个答案: