Question

使用Python 2.7，我遇到了以下问题：我有想要清理的网址，特别是我想摆脱“http：//”。

这有效：

>>> url = 'http://www.party.com'
>>> url.lstrip('http://')
'www.party.com'

但为什么这不起作用？

>>> url = 'http://party.com'
>>> url.lstrip('http://')
'arty.com'

它摆脱了'派对'中的'p'。

感谢您的帮助。

Answer 1

将lstrip的参数视为字符，而不是字符串。

url.lstrip('http://')从h删除所有前导t，:，/，url。

改为使用str.replace：

>>> url = 'http://party.com'
>>> url.replace('http://', '', 1)
'party.com'

如果你真正想要的是从网址获取主机名，你也可以使用urlparse.urlparse：

>>> urlparse.urlparse('http://party.com').netloc
'party.com'
>>> urlparse.urlparse('http://party.com/path/to/some-resource').netloc
'party.com'