Question

问题

代码无法正确识别输入（项目）。它只是转储到我的失败消息，即使CSV文件中存在这样的值。谁能帮我确定我做错了什么？

背景

我正在开发一个小程序，询问用户输入（此处未给出的功能），搜索CSV文件（Item）中的特定列并返回整行。 CSV数据格式如下所示。我缩短了实际数量的数据（49个字段名称，18000多行）。

代码

import csv
from collections import namedtuple
from contextlib import closing

def search():
    item = 1000001
    raw_data = 'active_sanitized.csv'
    failure = 'No matching item could be found with that item code. Please try again.'
    check = False

    with closing(open(raw_data, newline='')) as open_data:
        read_data = csv.DictReader(open_data, delimiter=';')
        item_data = namedtuple('item_data', read_data.fieldnames)
        while check == False:
            for row in map(item_data._make, read_data):
                if row.Item == item:
                    return row
                else:
                    return failure

CSV结构

active_sanitized.csv
Item;Name;Cost;Qty;Price;Description
1000001;Name here:1;1001;1;11;Item description here:1
1000002;Name here:2;1002;2;22;Item description here:2
1000003;Name here:3;1003;3;33;Item description here:3
1000004;Name here:4;1004;4;44;Item description here:4
1000005;Name here:5;1005;5;55;Item description here:5
1000006;Name here:6;1006;6;66;Item description here:6
1000007;Name here:7;1007;7;77;Item description here:7
1000008;Name here:8;1008;8;88;Item description here:8
1000009;Name here:9;1009;9;99;Item description here:9

备注

我使用Python的经验相对较少，但我认为从一开始就是一个很好的问题，以便了解更多信息。

我确定了打开（并在close函数中包装）CSV文件的方法，通过DictReader读取数据（获取字段名称），然后创建一个命名元组，以便能够快速选择所需的列输出（项目，成本，价格，名称）。列顺序很重要，因此使用DictReader和namedtuple。

虽然有可能对每个字段名称进行硬编码，但我觉得如果程序可以在文件打开时读取它们，那么在处理具有相同列名但不同的类似文件时会更有帮助专栏组织。

研究

CSV标头和命名元组： What is the pythonic way to read CSV file data as rows of namedtuples?
将CSV数据转换为元组：How to split a CSV row so row[0] is the name and any remaining items are a tuple?
还有其他研究链接，但我不能发布超过两个。

Answer 1

你有三个问题：

你在第一次失败时返回，所以永远不会越过第一行。
您正在从文件中读取字符串，并与int进行比较。
_make遍历字典键而不是值，产生错误的结果（item_data(Item='Name', Name='Price', Cost='Qty', Qty='Item', Price='Cost', Description='Description')）。
```
for row in (item_data(**data) for data in read_data):
    if row.Item == str(item):
        return row
return failure
```

这解决了手头的问题 - 我们检查一个字符串，我们只返回没有匹配的项目（尽管你可能想要开始将字符串转换为数据中的int而不是字符串的这个hackish修复/ int issue）。

我还改变了循环的方式 - 使用生成器表达式使用更自然的语法，使用dict命名属性的常规构造语法。与使用_make和map()相比，这更清晰，更易读。它还解决了问题3。

在单个列中搜索CSV文件中的特定值并返回整行

1 个答案: