Question

我需要从一个字符串中拆分一个子字符串，正是这个源文本：

文章发表于：教程

我要删除“发表于：”的文章，仅保留

教程

，这样我就可以保存我尝试：

category = items[1]
category.split('Article published on:','')

并

for p in articles:
            bodytext = p.xpath('.//text()').extract()
            joined_text = ''
            # loop in categories
            for each_text in text:
                stripped_text = each_text.strip()
                if stripped_text:
                    # all the categories together
                    joined_text += ' ' + stripped_text
            joined_text = joined_text.split('Article published on:','')
    items.append(joined_text)
            if not is_phrase:
                title = items[0]
                category = items[1]
                print('title = ', title)
                print('category = ', category)

这不起作用，我还缺少什么？

此代码错误：

TypeError：“ str”对象不能解释为整数

Answer 1

您可能只是忘了分配结果：

category = category.replace('Article published on:', '')

同样，您似乎打算使用replace而不是split。后者也可以工作：

category = category.split(':')[1]

分割并保存文本字符串

1 个答案: