提取数字后跟和前面的单词

时间:2013-06-17 20:28:47

标签: python html beautifulsoup

提取数字后跟并以词语开头:

String q = 'Consumer spending in the US rose to about 62% of GDP in 1960, where it stayed until about 1981, and has since risen to 71% in 2013'
q = re.findall(r'^([^\d]+)\s(\d+)\s*,\s*([^\d]+)\s(\d+)',s)

它给出了q中所有单词和数字的列表。 所以现在我想要方法来获得数字和单词

1 个答案:

答案 0 :(得分:1)

根据您的描述,我猜你需要这样的东西:

>>> import re
>>> strs = 'Consumer spending in the US rose to about 62% of GDP in 1960, where it stayed until about 1981, and has since risen to 71% in 2013'
>>> re.findall(r'\w+\s\d+.*?\s\w+',strs)
['about 62% of', 'in 1960, where', 'about 1981, and', 'to 71% in']