Question

我有一个文本很多的文件。不幸的是，在这些点之前或之后还存在一些带有一些空格的URL。示例：http://www .test27d .com/site1

如何替换这些空格，以便仅纠正URL（而不纠正其他文本，因为有时必须在一个点之前或之后有一个空格）。

Answer 1

找到所有与网址条件匹配的字符串，以http开头，然后通过删除空格进行翻译

import re
a='http://www .test27d .com/site1'
for i in re.findall('(^http://[\w\s\.\/]*)',a):
    print(i.translate(None,' '))

用于测试

list_with_statements=['http://www .test27d .com/site1', 'string_with_no_spaces', 'string has spaces']
new_list=[]
for stat in list_with_statements:
    if re.search('(^http://[\w\s\.\/]*)',stat): # can also use str.startswith()
        stat=i.translate(None,' ')
    new_list.append(stat)

没有正则表达式

list_with_statements=['http://www .test27d .com/site1', 'string_with_no_spaces', 'string has spaces .']
new_list=[]
for stat in list_with_statements:
    if stat.startswith('http'):
        stat=i.translate(None,' ')
    new_list.append(stat)
print(new_list)

输出

['http://www.test27d.com/site1', 'string_with_no_spaces', 'string has spaces']

Answer 2

尝试一下：

newstring = string.replace(' ', '')

仅删除URL中的空格

2 个答案: