Question

我试图从长文本文件中提取完整句子，并将它们作为字符串添加到Python 2.7中的列表中。我希望自动化这个，而不仅仅是剪切和粘贴在列表中。

这就是我所拥有的：

Counter.most_common()

没有任何内容加载到列表中。

Answer 1

您可以使用while循环执行此操作：

listed = []
with open(filename,"r") as text:
    Line = text.readline()


    while Line!='':
        listed.append(Line)
        Line = text.readline()

print listed

在前面的例子中，我假设每个句子都写在不同的行上，如果不是这样的话，请使用这个代码：

listed = []


with open(filename,"r") as text:
    Line = text.readline()


    while Line!='':
        Line1 = Line.split(".")
        for Sentence in Line1:
            listed.append(Sentence)
        Line = text.readline()

print listed

在旁注中，请尝试使用with open(...) as text:代替text = open(...)

Answer 2

通常句子由'. '分隔，而不是'\n'。在这种情况下，请使用period+space（不使用return-enter）拆分：

listed = []

fd = open(filename,"r") 
try:
    data = fd.read()
    sentences = data.split(". ")
    for sentence in sentences:
        listed.append(sentence)

    print listed
finally:
    fd.close()

从文本文件中复制完整的句子并添加到列表中

2 个答案: