从python中的一组链接中提取数字

时间:2019-08-21 11:58:40

标签: python regex

我有一个包含以下链接的数组:

https://www.example.com/category/brand_listing/soap-bodywash-shampoo-and-conditioner/1106
https://www.example.com/category/brand_listing/soap-bodywash-shampoo-and-conditioner/1106
https://www.example.com/category/brand_listing/face-care-beauty-and-makeup/1098
https://www.example.com/category/brand_listing/face-care-beauty-and-makeup/1098
https://www.example.com/category/brand_listing/deo-talc-and-wipes/1097
https://www.example.com/category/brand_listing/deo-talc-and-wipes/1097
https://www.example.com/category/brand_listing/grocery-and-gourmet/1120
https://www.example.com/category/brand_listing/grocery-and-gourmet/1120
https://www.example.com/category/brand_listing/health-drinks/1117
https://www.example.com/category/brand_listing/health-drinks/1117
https://www.example.com/category/brand_listing/oral-and-dental-care/1104
https://www.example.com/category/brand_listing/oral-and-dental-care/1104

我要提取这些网址中的最后4位数字

1106
1098
1097
1120
1117
1104

我正在使用python。

最简单的方法是什么? 正则表达式是要走的路吗?

1 个答案:

答案 0 :(得分:1)

x=['https://www.example.com/category/brand_listing/soap-bodywash-shampoo-and-conditioner/1106',
'https://www.example.com/category/brand_listing/soap-bodywash-shampoo-and-conditioner/1106']
[i[-4:] for i in x]

考虑目标的特定位置,正则表达式不是一种有效的方法。