考虑以下代码:
class Foo1(dict):
def __getattr__(self, key): return self[key]
def __setattr__(self, key, value): self[key] = value
class Foo2(dict):
__getattr__ = dict.__getitem__
__setattr__ = dict.__setitem__
o1 = Foo1()
o1.x = 42
print(o1, o1.x)
o2 = Foo2()
o2.x = 42
print(o2, o2.x)
我希望输出相同。但是,使用CPython 2.5,2.6(类似于3.2),我得到:
({'x': 42}, 42)
({}, 42)
使用PyPy 1.5.0,我得到了预期的输出:
({'x': 42}, 42)
({'x': 42}, 42)
哪个是“正确”输出? (或者根据Python文档应该输出什么?)
Here是CPython的错误报告。
答案 0 :(得分:7)
我怀疑它与查找优化有关。来自源代码:
/* speed hack: we could use lookup_maybe, but that would resolve the
method fully for each attribute lookup for classes with
__getattr__, even when the attribute is present. So we use
_PyType_Lookup and create the method only when needed, with
call_attribute. */
getattr = _PyType_Lookup(tp, getattr_str);
if (getattr == NULL) {
/* No __getattr__ hook: use a simpler dispatcher */
tp->tp_getattro = slot_tp_getattro;
return slot_tp_getattro(self, name);
}
快速路径不会在类字典中查找。
因此,获得所需功能的最佳方法是在类中放置覆盖方法。
class AttrDict(dict):
"""A dictionary with attribute-style access. It maps attribute access to
the real dictionary. """
def __init__(self, *args, **kwargs):
dict.__init__(self, *args, **kwargs)
def __repr__(self):
return "%s(%s)" % (self.__class__.__name__, dict.__repr__(self))
def __setitem__(self, key, value):
return super(AttrDict, self).__setitem__(key, value)
def __getitem__(self, name):
return super(AttrDict, self).__getitem__(name)
def __delitem__(self, name):
return super(AttrDict, self).__delitem__(name)
__getattr__ = __getitem__
__setattr__ = __setitem__
def copy(self):
return AttrDict(self)
我找到的工作符合预期。
答案 1 :(得分:3)
这是一个已知的(可能不太好)记录的差异。 PyPy不区分函数和内置函数。在CPython函数中,当存储在类(具有__get__)时,将绑定为未绑定方法,而内置函数则不会(它们是不同的)。
然而,在PyPy中,内置函数与python函数完全相同,因此解释器无法区分它们并将它们视为python级函数。我认为这被定义为实现细节,尽管有一些关于删除这种特殊差异的python-dev的讨论。
干杯,
fijal
答案 2 :(得分:1)
请注意以下事项:
>>> dict.__getitem__ # it's a 'method'
<method '__getitem__' of 'dict' objects>
>>> dict.__setitem__ # it's a 'slot wrapper'
<slot wrapper '__setitem__' of 'dict' objects>
>>> id(dict.__dict__['__getitem__']) == id(dict.__getitem__) # no bounding here
True
>>> id(dict.__dict__['__setitem__']) == id(dict.__setitem__) # or here either
True
>>> d = {}
>>> dict.__setitem__(d, 1, 2) # can be called directly (since not bound)
>>> dict.__getitem__(d, 1) # same with this
2
现在我们可以将它们包装起来(即使没有它,__getattr__
也能正常工作):
class Foo1(dict):
def __getattr__(self, key): return self[key]
def __setattr__(self, key, value): self[key] = value
class Foo2(dict):
"""
It seems, 'slot wrappers' are not bound when present in the __dict__
of a class and retrieved from it via instance (or class either).
But 'methods' are, hence simple assignment works with __setitem__
in your original example.
"""
__setattr__ = lambda *args: dict.__setitem__(*args)
__getattr__ = lambda *args: dict.__getitem__(*args) # for uniformity, or
#__getattr__ = dict.__getitem__ # this way, i.e. directly
o1 = Foo1()
o1.x = 42
print(o1, o1.x)
o2 = Foo2()
o2.x = 42
print(o2, o2.x)
给出了:
>>>
({'x': 42}, 42)
({'x': 42}, 42)
有问题的行为背后的机制(可能,我不是专家)在Python的“干净”子集之外(如“学习Python”或“简单的Python”等详尽的书籍中所述,并且有点松散地指定在python.org)并且与实施中“按原样”记录的语言部分有关(并且经常(经常)更改)。