Question

with open('sentence.txt') as txtfile:
string = str(txtfile.read())
list_of_str = string.split()
new_list = []
for d in range(len(list_of_str)):
    if not (list_of_str[d].isalpha() or list_of_str[d].isalnum()):
       print(list_of_str[d-1], list_of_str[d:])
       new_list.append(str(list_of_str[d-1]) + str(list_of_str[d]) + str(list_of_str[d+1]))
    else:
        new_list.append(list_of_str[d])
print(new_list)

Output: ['Onlineï»¿MyAwesome', 'Awesome', 'Company', 'billing', 'billing@example', 'example', 'example.com', 'com', 'Contractor', 'Invoice', 'Invoice#000015', '000015', 'Acme', 'Projects', 'Projects-Taxable', 'Taxable', 'Product', 'Contractor', 'Invoice', 'Summary', 'Account', 'Information', 'Don', 'Test', 'don', 'don@example', 'example', 'example.com', 'com', 'Contractor', 'Invoice', 'Date', 'Date:10', '10', '10/26', '26', '26/2016', '2016', 'Amount', 'Due', 'Due$21', '21']

简单来说，我需要删除特殊字符前后的空格。您也可以分享一个很好的资源来学习

Expanded widgets must be placed directly inside Flex widgets.

起初我尝试使用它，但我认为正则表达式可以提供帮助

谢谢

Answer 1

是的，您可以使用正则表达式轻松地解决此问题，而不是使用当前代码。

您可以使用此正则表达式，

([@.#$\/:-]) ?（空格后跟具有特殊字符的字符集，后跟一个可选空格。您可以根据需要在该字符集中添加更多字符。）

此正则表达式在字符集中捕获一个空格，后跟一个字符，然后是可选空格，并将其替换为在组1中捕获的字符。

Demo

示例python代码，

import re
s = 'My Awesome Company billing @ example . com Contractor Invoice # 000015 Acme Projects - Taxable Product Contractor Invoice Summary Account Information Don Test don @ example . com Contractor Invoice Date : 10 / 26 / 2016 Amount Due $ 21 .'
s = re.sub(' ([@.#$\/:-]) ?',r'\1', s)
print(s)

给出以下输出，

My Awesome Company billing@example.com Contractor Invoice#000015 Acme Projects-Taxable Product Contractor Invoice Summary Account Information Don Test don@example.com Contractor Invoice Date:10/26/2016 Amount Due$21.

让我知道这是否适合您。

删除特殊字符前后的空白并加入python

1 个答案: