Question

我正在尝试通过执行一些旧的Google Code Jam问题来练习C ++。我发现一个相对简单的方法是反转字符串中的单词。它可以在https://code.google.com/codejam/contest/351101/dashboard#s=p1

找到

到目前为止，我有：

#include<iostream>
using namespace std;

int main(){
    int n = 0;
    cin >> n;


    string rev = "";
    string buf = "";

    string data = "";
    getline(cin, data);

    for(int _ = 0; _ < n; _++){
        getline(cin, data);

        rev = "";
        buf = "";
        for(char& c : data) {
            buf += c;
            if(c == ' '){
                rev = buf + rev;
                buf = "";
            }
        }

        cout << "Case #" << _ + 1 << ": " << buf << " " << rev << endl;
    }

    return 0;
}

这似乎跑得很快。在使用大约time ./reverse < in > /dev/null个案例的测试文件运行1.2E6时，使用3.5进行编译时大约需要g++ -O3秒。

因此，作为基准测试，我在python中创建了一个解决方案

#!/usr/bin/env python
from sys import stdin, stdout
stdout.writelines(map(lambda n: "Case #%d: %s\n" % (n + 1, ' '.join(stdin.readline().split()[::-1])), xrange(int(stdin.readline()))))

但是，当我使用pypy在time pypy reverse.py < in > /dev/null下运行时，只需1.95秒。

理论上，pypy是用C ++编写的，C ++不应该更快或更快，如果是这样，那么如何优化这些代码以更快？

Answer 1

我认为你的C ++代码在连接字符串时会做很多内存拷贝（std :: string的大多数实现都会使整个字符串在内存中保持连续。）我认为下面的代码没有副本就可以做到这一点，但我没有测试很多。至于为什么python表现得很好，我不完全确定。

#include<iostream>

int main()
{
    size_t numCases;
    std::cin >> numCases;
    std::cin.ignore();

    for( size_t currentCase = 1; currentCase <= numCases; ++currentCase )
    {
        std::cout << "Case #" << currentCase << ": ";

        std::string line;
        getline(std::cin, line);
        size_t wordEnd = line.length() - 1;
        size_t lastSpace = std::string::npos;
        for ( int pos = wordEnd - 1; pos >= 0; --pos )
        {
            if ( line[pos] == ' ' )
            {
                for ( int prt = pos + 1; prt <= wordEnd; ++prt )
                    std::cout << line[prt];
                std::cout << ' ';
                lastSpace = pos;
                wordEnd = pos - 1;
                --pos;
            }
        }
        for ( int prt = 0; prt < lastSpace; ++prt )
            std::cout << line[prt];

        std::cout << std::endl;
    }

    return 0;
}

Answer 2

一个简单的非复制/非分配标记器是可恶的std::strtok

以下在我的测试中打败了你的python程序

#include <iostream>
#include <iterator>
#include <algorithm>
#include <vector>
#include <cstring>

int main()
{
    std::cout.sync_with_stdio(false); // we don't need C in the picture

    std::string line;
    getline(std::cin, line);
    int num_cases = stoi(line);

    std::vector<char*> words;
    for(int n = 0; getline(std::cin, line) && n < num_cases; ++n)
    {   
        words.clear();
        char* p = std::strtok(&line[0], " ");
        while (p) {
            words.push_back(p);
            p = std::strtok(nullptr, " ");
        }
        std::cout << "Case #" << n + 1 << ": ";
        reverse_copy(words.begin(), words.end(),
                     std::ostream_iterator<char*>(std::cout, " "));
        std::cout << '\n'; // never std::endl!
    }
}

PS：您的C ++和python输出不完全匹配;该程序与您的C ++输出匹配

Answer 3

您可以利用算法和迭代器库来完成更简单的操作，而不是使用两个缓冲区和大量连接。我不确定会有多快（虽然我会猜得很多但是），但它也更加紧凑。

#include<iostream>
#include<algorithm>
#include<iterator>
#include<sstream>
using namespace std;

int main(){
    int n = 0;
    cin >> n;
    string data = "";
    getline(cin, data);
    for(int _ = 0; _ < n; _++){
        getline(cin, data);
        stringstream ss(data);
        reverse(istream_iterator<string>(ss), istream_iterator<string>());
        cout << "Case #" << _ + 1 << ": " << ss.str() << endl;
    }
    return 0;
}

我该如何优化这个C ++？

3 个答案: