我正在尝试使用以下代码:
for x in story:
var1 = str(x)
var1 = var1.replace("<p>", "\n")
var1 = var1.replace("</p>", "")
story[x] = var1
删除段落标记,插入换行符,然后将它们重新插入变量中。字符串如下:
Panera Bread (NASDAQ: <a class="ticker" href="/stock/pnra#NASDAQ">PNRA</a>) is down 6 percent today over expectations of food inflation of 4.5% in Q3 and 5% for Q4. In addition, Panera Will Raise Menu Prices in Q4.
PNRA recently posted second quarter 2011 earnings of $1.18 per share. Reported earnings also outpaced the prior-year quarter earnings of 85 cents per share.
But shares were also lower ahead of the opening bell after the company reported weaker-than-expected same-store sales figures for its recent quarter late Tuesday. Its profit of $1.18 a share topped analysts' consensus call by a penny.
For the twenty-six weeks ended June 28, 2011, net income was $68 million, or $2.27 per diluted share. These results compare to net income of $53 million, or $1.67 per diluted share, for the twenty-six weeks ended June 29, 2010, and represent a 36% year-over-year increase in diluted earnings per share.
我收到的错误消息是:
Traceback (most recent call last):
File "C:\Python27\Sample Programs\Get Stuff from Pages\Pages and Stuff 0.1.py", line 34, in <module>
story[x] = var1
TypeError: list indices must be integers, not Tag
答案 0 :(得分:2)
for
循环迭代地将列表story
的元素替换为x
变量,而[]
列表指令需要元素索引。这会导致错误。
l = ['a','b']
print l[0]
print l['a'] // type error
编辑:我错过了该故事不包含字符串。这种变化可以完成这项工作:
story = [str(x).replace("<p>","\n").replace("<\p>","") for x in story]
注意:现在story
由字符串组成,而不是标记。
答案 1 :(得分:0)
您可能希望将这些项目附加到列表中:
story.append(var1)