Firefox糟糕的RegEx性能

时间:2013-10-30 10:53:27

标签: javascript regex performance firefox

我使用JavaScript解析器生成器JISON为我的用户创建的一些脚本创建解析器。最近我注意到Firefox上的解析过程比我的页面支持的任何其他浏览器(IE10,最新的Chrome和Opera)要慢很多。

在深入挖掘生成的解析器的源代码后,我将问题缩小到一行代码,执行一些正则表达式来代码解析代码。当然这条线经常被执行。

我创建了一个带有一些随机字符串(大约1300个字符)和一个非常通用的正则表达式的小测试用例。此测试用例测量执行正则表达式10000次(Working example on JSFiddle)所需的平均时间:

$(document).ready(function() {
    var str = 'asdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj ölkasjd flöaksjdf löask dfjkasdfasdfa asdfasdf asdf asdf asdfasödlfkja asldfkj asdölkfj aslödkjf aösldkfj',
        regex = new RegExp('^([0-9])+'),
        durations = [],
        resHtml = 'Durations:',
        totalDuration = 0,
        matches, start;

    // Perform "timing test" 10 times to get some average duration
    for (var i = 0; i < 10; i++) {
        // Execute regex 10000 times and see how long it takes
        start = window.performance.now();
        for (var j = 0; j < 10000; j++) {
            regex.exec(str);
        }
        durations.push(window.performance.now() - start);
    }

    // Create output string and update DIV
    for (var i = 0; i < durations.length; i++) {
        totalDuration += durations[i];
        resHtml += '<br>' + i + ': ' + (parseInt(durations[i] * 100, 10) / 100) + ' ms';
    }
    resHtml += '<br>==========';
    resHtml += '<br>Avg: ' + (parseInt((totalDuration / durations.length) * 100, 10) / 100) + ' ms';

    $('#result').html(resHtml);
});

以下是我的机器上的测试结果:

Firefox 24 :平均时间介于 370&amp;执行10000次正则表达式的450毫秒 Chrome 30,Opera 17,IE 10 :平均时间介于 0.3&amp; 0.6毫秒

如果要测试的字符串变大,这种差异会变得更大。 6000字符长的字符串将Firefox中的平均时间增加到 ~1.5秒(!),而其他浏览器仍然需要 ~0.5毫秒(!)Working example on JSFiddle with 6000 characters )。

为什么Firefox与所有其他浏览器之间存在如此巨大的性能差异,无论如何我能改进它吗?

请注意,我无法调整执行的正则表达式本身,因为它们主要由解析器生成器生成,我不想手动更改生成的解析器代码。

1 个答案:

答案 0 :(得分:2)

正是RegExp抓捕分组让你:

/^[0-9]+/和/或/^(?:[0-9])+/和/或/^([0-9]+)//^([0-9])+/快几个数量级。它们应该是可行的替代方案。

我希望捕捉群体的速度会略微慢一些,但是我觉得这样会慢得多。然而,慢版本有可能创建大量和大量的捕获,而其他版本没有,所以这似乎是一个重要的区别。

Unscientific jsperf

您可能需要file a bug