Question

这里的简单问题;我在CSV文本文件中有一个名为stop_words.txt的停用词列表。

我正在使用此代码将这些代码添加到列表中：

>>> import csv
>>> stops = []
>>> with open('/stop_words.txt', 'rU') as f:
    reader = csv.reader(f)
    for row in reader:
        stops.append(row)

问题是当我跑

时

>>> len(stops)
1

我的长度为1.内容如下：

>>> stops
[['a', 'able', 'about', 'across', 'after', 'all', 'almost', 'also', 'am', 'among', 'an', 'and', 'any', 'are', 'as', 'at', 'be', 'because', 'been', 'but', 'by', 'can', 'cannot', 'could', 'dear', 'did', 'do', 'does', 'either', 'else', 'ever', 'every', 'for', 'from', 'get', 'got', 'had', 'has', 'have', 'he', 'her', 'hers', 'him', 'his', 'how', 'however', 'i', 'if', 'in', 'into', 'is', 'it', 'its', 'just', 'least', 'let', 'like', 'likely', 'may', 'me', 'might', 'most', 'must', 'my', 'neither', 'no', 'nor', 'not', 'of', 'off', 'often', 'on', 'only', 'or', 'other', 'our', 'own', 'rather', 'said', 'say', 'says', 'she', 'should', 'since', 'so', 'some', 'than', 'that', 'the', 'their', 'them', 'then', 'there', 'these', 'they', 'this', 'tis', 'to', 'too', 'twas', 'us', 'wants', 'was', 'we', 'were', 'what', 'when', 'where', 'which', 'while', 'who', 'whom', 'why', 'will', 'with', 'would', 'yet', 'you', 'your']]

这里列表中有一个列表，但我不明白为什么。

非常感谢。

Answer 1

csv.reader返回每行的列表。由于您将row添加到stops，因此您可以向列表添加列表。为防止这种情况，您可以使用：

stops.extend(row)

甚至更好，使用列表理解：

stops = [item for row in reader for item in row]

Answer 2

看起来您的stop_words.txt文件只有一条长线。您可以直接使用此列表：

with open('/stop_words.txt', 'rU') as f:
    stops = next(csv.reader(f))

Answer 3

您的csv阅读器将在逗号上拆分您的行，并返回一个列表。然后，将该列表（作为单个元素）添加到rows列表中。而是遍历返回的行并将每个条目添加到停用词列表中。

Answer 4

我假设CSV文件中只有一行，这是所有停用词的列表。您正在尝试构建一个“行”列表，其中行是列表。这正是发生的事情;只有一排。由于只有一行，您可以将stops分配给csv中的第一行。

为什么这个List插入一个嵌套的List？

4 个答案: