with open('sentence.txt') as txtfile:
string = str(txtfile.read())
list_of_str = string.split()
new_list = []
for d in range(len(list_of_str)):
if not (list_of_str[d].isalpha() or list_of_str[d].isalnum()):
print(list_of_str[d-1], list_of_str[d:])
new_list.append(str(list_of_str[d-1]) + str(list_of_str[d]) + str(list_of_str[d+1]))
else:
new_list.append(list_of_str[d])
print(new_list)
Output: ['OnlineMyAwesome', 'Awesome', 'Company', 'billing', 'billing@example', 'example', 'example.com', 'com', 'Contractor', 'Invoice', 'Invoice#000015', '000015', 'Acme', 'Projects', 'Projects-Taxable', 'Taxable', 'Product', 'Contractor', 'Invoice', 'Summary', 'Account', 'Information', 'Don', 'Test', 'don', 'don@example', 'example', 'example.com', 'com', 'Contractor', 'Invoice', 'Date', 'Date:10', '10', '10/26', '26', '26/2016', '2016', 'Amount', 'Due', 'Due$21', '21']
简单来说,我需要删除特殊字符前后的空格。您也可以分享一个很好的资源来学习
Expanded widgets must be placed directly inside Flex widgets.
起初我尝试使用它,但我认为正则表达式可以提供帮助
谢谢
答案 0 :(得分:0)
是的,您可以使用正则表达式轻松地解决此问题,而不是使用当前代码。
您可以使用此正则表达式,
([@.#$\/:-]) ?
(空格后跟具有特殊字符的字符集,后跟一个可选空格。您可以根据需要在该字符集中添加更多字符。)
此正则表达式在字符集中捕获一个空格,后跟一个字符,然后是可选空格,并将其替换为在组1中捕获的字符。
示例python代码,
import re
s = 'My Awesome Company billing @ example . com Contractor Invoice # 000015 Acme Projects - Taxable Product Contractor Invoice Summary Account Information Don Test don @ example . com Contractor Invoice Date : 10 / 26 / 2016 Amount Due $ 21 .'
s = re.sub(' ([@.#$\/:-]) ?',r'\1', s)
print(s)
给出以下输出,
My Awesome Company billing@example.com Contractor Invoice#000015 Acme Projects-Taxable Product Contractor Invoice Summary Account Information Don Test don@example.com Contractor Invoice Date:10/26/2016 Amount Due$21.
让我知道这是否适合您。