如何进行以下文本文件的文本分类?我想将每个段落作为熊猫数据框中的一行,但是我无法做到这一点

时间:2019-01-26 02:48:04

标签: python machine-learning

https://i.stack.imgur.com/AbOLA.jpg

                        1

Loren ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum ipem sumsum ipsum lorsum ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum lorem ipsum

有154个这样的段落,我想在python中将每个段落读为一行。请查看图片以获取清晰的示例

1 个答案:

答案 0 :(得分:0)

f = open('sample_text.txt', 'r')
data = f.read()
paragraphs = data.split("\n\n")
paragraphs[:] = (value for value in paragraphs if value != '\t')