Question

我已经坚持了几天的问题了，如果有人可以提供帮助，我将不胜感激。

我有一个数据框，其中有一列填充整数列表项。

例如，单列数据框：

>>> df=pd.DataFrame({'a':[[1, 2, 3], [2, 4, 5], [1, 7, 8]]})

>>> df
           a
0  [1, 2, 3]
1  [2, 4, 5]
2  [1, 7, 8]

我想在数据框上运行查询，以选择其元素包含特定值的行。＆＃34; in＆＃34;操作员不为此操作工作。我定义了一个我在查询中调用的函数func

>>> def func(l, v):
...     return l.apply(lambda val: v in val)

然后当我调用查询时，它在python 3.6.3上运行正常（xubuntu默认安装，通过pip3进行一些更新）。它返回唯一包含值7的行，例如

>>> df.query('@func(a, 7)')
           a
2  [1, 7, 8]

然而，当我在最后一个anaconda版本中包含的python 3.6.4上运行它时，它失败并显示以下消息：＆＃39; Series＆＃39;对象是可变的，因此它们不能被散列。

>>> df.query('@func(a, 7)') Traceback (most recent call last):   File "<stdin>", line 1, in <module>   File
"/home/cedric/.local/lib/python3.6/site-packages/pandas/core/frame.py",
line 2297, in query
    res = self.eval(expr, **kwargs)   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/frame.py",
line 2366, in eval
    return _eval(expr, inplace=inplace, **kwargs)   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/computation/eval.py",
line 295, in eval
    ret = eng_inst.evaluate()   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/computation/engines.py",
line 76, in evaluate
    res = self._evaluate()   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/computation/engines.py",
line 122, in _evaluate
    _check_ne_builtin_clash(self.expr)   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/computation/engines.py",
line 31, in _check_ne_builtin_clash
    names = expr.names   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/computation/expr.py",
line 755, in names
    return frozenset([self.terms.name])   File "/home/cedric/.local/lib/python3.6/site-packages/pandas/core/generic.py",
line 1045, in __hash__
    ' hashed'.format(self.__class__.__name__)) TypeError: 'Series' objects are mutable, thus they cannot be hashed

我希望我的函数能够运行我使用的python3（＆gt; = 3.6）。也许我做错了。任何帮助将不胜感激。

编辑1 ：我在两种情况下都使用pandas 0.22.0。

解：我找到了解决方案。出现此问题的原因是默认引擎=＆＃39; numexpr＆＃39;与anaconda的查询功能。设置engine =＆＃39; python＆＃39;时，它会再次运行。

Answer 1

似乎问题是由于anaconda的查询功能的默认引擎='numexpr'而发生的。设置engine ='python'时，它会再次运行。

我仍然无法弄清楚为什么它不适用于numexpr引擎，但我可以接受这个。

如何使用整数列表元素查询python3数据帧

1 个答案: