为什么Pythons I / O比C ++慢得多?

时间:2012-06-29 09:01:29

标签: c++ python io

我已经创建了一些用于Python和C ++测试的代码,我从文件中读取两个矩阵并打印一些东西。似乎Python需要大约两倍的I / O时间:

$ ./test.sh -i Testing/2000.in -p "C++/read-write-only.out" -n 2
Executing: C++/read-write-only.out -i Testing/2000.in > TMPcurrentFileResult.out
It took 8 seconds for 2 executions
MIN: 4 seconds
MAX: 4 seconds

$ ./test.sh -i Testing/2000.in -p "python Python/read-write-only.py" -n 2
Executing: python Python/read-write-only.py -i Testing/2000.in > TMP..Results.out
It took 16 seconds for 2 executions
MIN: 8 seconds
MAX: 8 seconds

这是我用于Python的代码:

#!/usr/bin/python
# -*- coding: utf-8 -*-

from optparse import OptionParser
parser = OptionParser()
parser.add_option("-i", dest="filename", default="bigMatrix.in",
     help="input file with two matrices", metavar="FILE")
(options, args) = parser.parse_args()

def read(filename):
    lines = open(filename, 'r').read().splitlines()
    A = []
    B = []
    matrix = A
    for line in lines:
        if line != "":
            matrix.append(map(int, line.split("\t")))
        else:
            matrix = B
    return A, B

def printMatrix(matrix):
    for line in matrix:
        print "\t".join(map(str,line))

A, B = read(options.filename)
# Do something
printMatrix(B)

这是C ++ - 代码

#include <sstream>
#include <string>
#include <fstream>
#include <iostream>
#include <vector>
#include <algorithm>

using namespace std;

int getMatrixSize(string filename) {
    string line;
    ifstream infile;
    infile.open (filename.c_str());
    getline(infile, line);
    return count(line.begin(), line.end(), '\t') + 1;
}

void read(string filename, vector< vector<int> > &A, vector< vector<int> > &B){
    string line;
    FILE* matrixfile = freopen(filename.c_str(), "r", stdin);

    int i = 0, j, a;
    while (getline(cin, line) && !line.empty()) {
        istringstream iss(line);
        j = 0;
        while (iss >> a) {
            A[i][j] = a;
            j++;
        }
        i++;
    }

    i = 0;
    while (getline(cin, line)) {
        istringstream iss(line);
        j = 0;
        while (iss >> a) {
            B[i][j] = a;
            j++;
        }
        i++;
    }

    fclose (matrixfile);
}

void printMatrix(vector< vector<int> > matrix, int n) {
    for (int i=0; i < n; i++) {
        for (int j=0; j < n; j++) {
            if (j != 0) {
                cout << "\t";
            }
            cout << matrix[i][j];
        }
        cout << endl;
    }
}

int main (int argc, char* argv[]) {
    string filename;
    if (argc < 3) {
        filename = "bigMatrix.in";
    } else {
        filename = argv[2];
    }

    int n = getMatrixSize(filename);
    vector<int> inner (n);
    vector< vector<int> > A(n, inner), B(n, inner), C(n, inner);
    read (filename, A, B);
    // do something with the matrices
    printMatrix(C, n);
    return 0;
}

是否有可能像C ++一样快速地获得Python的I / O? 我怎样才能改进Python / C ++的I / O?

(我听说scanf应该比cin更快。为什么它会更快?)

这是所有代码的GIT-Repository

3 个答案:

答案 0 :(得分:1)

启动python解释器需要一段时间。在运行测试时要考虑到这一点。

答案 1 :(得分:0)

您必须以任何语言缓冲I / O.

为什么要比较Python和C ++?

Python是一种解释型语言,而C ++则是编译的。

答案 2 :(得分:0)

如果不混合使用C和C ++文件操作例程,则应关闭与stdio的同步。

http://www.cplusplus.com/reference/iostream/ios_base/sync_with_stdio/

cincout通常应该比C对应的更快(同步关闭)。

就Python的缓慢而言,为什么不检查I / O功能的实现呢?