Question

在我的代码中，我使用eval来评估用户给出的字符串表达式。有没有办法编译或加快这个陈述？

import math
import random

result_count = 100000
expression = "math.sin(v['x']) * v['y']"

variable = dict()
variable['x'] = [random.random() for _ in xrange(result_count)]
variable['y'] = [random.random() for _ in xrange(result_count)]

# optimize anything below this line

result = [0] * result_count

print 'Evaluating %d instances of the given expression:' % result_count
print expression

v = dict()
for index in xrange(result_count):
    for name in variable.keys():
        v[name] = variable[name][index]
    result[index] = eval(expression) # <-- option ONE
    #result[index] = math.sin(v['x']) * v['y'] # <-- option TWO

对于快速比较选项，ONE在我的机器上需要2.019秒，而选项TWO仅需0.218秒。当然，Python有一种方法可以做到这一点而无需对表达式进行硬编码。

Answer 1

你也可以欺骗python：

expression = "math.sin(v['x']) * v['y']"
exp_as_func = eval('lambda: ' + expression)

然后像这样使用它：

exp_as_func()

速度测试：

In [17]: %timeit eval(expression)
10000 loops, best of 3: 25.8 us per loop

In [18]: %timeit exp_as_func()
1000000 loops, best of 3: 541 ns per loop

作为旁注，如果v不是全局的，您可以像这样创建lambda：

exp_as_func = eval('lambda v: ' + expression)

并称之为：

exp_as_func(my_v)

Answer 2

您可以使用{2}为Python 2或{3}}为Python 3预先编译表达式来避免开销：

In [1]: import math, compiler

In [2]: v = {'x': 2, 'y': 4}

In [3]: expression = "math.sin(v['x']) * v['y']"

In [4]: %timeit eval(expression)
10000 loops, best of 3: 19.5 us per loop

In [5]: compiled = compiler.compile(expression, '<string>', 'eval')

In [6]: %timeit eval(compiled)
1000000 loops, best of 3: 823 ns per loop

确保只进行一次编译（在循环之外）。正如评论中所提到的，当在用户提交的字符串上使用eval时，请确保您对所接受的内容非常小心。

Answer 3

我认为你正在优化错误的结局。如果你想对很多数字执行相同的操作，你应该考虑使用numpy：

import numpy
import time
import math
import random

result_count = 100000
expression = "sin(x) * y"

namespace = dict(
    x=numpy.array(
        [random.random() for _ in xrange(result_count)]),
    y=numpy.array(
        [random.random() for _ in xrange(result_count)]),
    sin=numpy.sin,
)
print ('Evaluating %d instances '
       'of the given expression:') % result_count
print expression

start = time.time()
result = eval(expression, namespace)
numpy_time = time.time() - start
print "With numpy:", numpy_time


assert len(result) == result_count
assert all(math.sin(a) * b == c for a, b, c in
           zip(namespace["x"], namespace["y"], result))

为了让您了解可能的增益，我使用泛型python和lambda技巧添加了一个变体：

from math import sin
from itertools import izip

start = time.time()
f = eval("lambda: " + expression)
result = [f() for x, y in izip(namespace["x"], namespace["y"])]
generic_time = time.time() - start
print "Generic python:", generic_time
print "Ratio:", (generic_time / numpy_time)

以下是我老化机器上的结果：

$ python speedup_eval.py 
Evaluating 100000 instances of the given expression:
sin(x) * y
With numpy: 0.006098985672
Generic python: 0.270224094391
Ratio: 44.3063992807

加速并不像我预期的那么高，但仍然很重要。

Answer 4

我知道这可能不是您所提问题的确切答案，但这可以帮助其他一些开发人员加快他们的eval的速度。

您可以使用Lupa library将Lua嵌入python中，就像在this benchmark中看到的那样，这是非常快的

这是一个简单的例子：

from lupa import LuaRuntime

lua = LuaRuntime()
lua_func = lua.eval('''
    function()
        return 'hi'
    end
''')
print(lua_func())

Python：加速重复执行的eval语句的方法？

4 个答案: