如何从文件中的单词创建两个不同的列表?

时间:2017-02-18 05:19:43

标签: python python-3.x

我有一个具有以下结构的文件

word1a word2a
word1b word2b
word1c word2c
word1d word2d
        \n -> Empty Line
word11a word21a
word11b word21b
word11c word21c
        \n -> Empty Line
word12a word22a
word12b word22b
word12c word22c
word12d word22d
         \n -> Empty Line
         \n -> Empty Line

我需要创建两个单独的列表,看起来像

wordList_1 = [[word1a,word1b,word1c,word1d],[word11a,word11b,word11c],[word12a,word12b,word12c,word12d]]
wordList_2 = [[word2a,word2b,word2c,word2d],[word21a,word21b,word21c],[word22a,word22b,word22c,word22d]]

我该如何有效地做到这一点?

我已经提出了如下解决方案,但我知道我在实现目标方面做得不好。因此,请查看下面的代码,让我知道如何更改它以达到预期的效果。

def fun(fName):
   Create two empty lists, words1 = [] and words2 = []
   with open(fname) as f:
       all_the_lines_in_file = f.read()
   lines = all_the_lines_in_file.split("\n\n") //Split The line based on new line
   for line in lines:
        l = line.split("\n")
        Create two empty lists, w1 = [] and w2 = []
        for words in l:
          if(len(words)>1):
            w = words.split()
            w1.append(w[0])
            w2.append(w[1])
        words1.append(w1)
        words2.append(w2)

1 个答案:

答案 0 :(得分:1)

可能比你的尝试更有效率:

words1, words2 = [], []
with open(fname) as f:
    w1, w2 = [], []
    for line in f:
        if line.strip(): # line is not empty
            words = line.split()
            w1.append(words[0])
            w2.append(words[1])
        else:
            words1.append(w1)
            words2.append(w2)
            w1, w2 = [], []
    # At end of the file
    if w1: # Checking lists are not empty, only need to check w1 or w2
        words1.append(w1) 
        words2.append(w2)

print(words1)
print(words2)

输出

[['word1a', 'word1b', 'word1c', 'word1d'], ['word11a', 'word11b', 'word11c'], ['word12a', 'word12b', 'word12c', 'word12d']]
[['word2a', 'word2b', 'word2c', 'word2d'], ['word21a', 'word21b', 'word21c'], ['word22a', 'word22b', 'word22c', 'word22d']]