Question

我正在尝试从命令行读取文件，将字符读入数组，计算字符个性并打印结果。代码编译时没有任何错误，但单个字母数量远远高于大文件的数量，并且有时对于小文件根本不计算。

#include <iostream>
#include <fstream>
#include <string.h> 
#include <cctype> 
#include <algorithm>

using namespace std;


int main(int argc, char **argv){ 
if(argc !=2){
     cout << "usage: " << argv[0] << " <filename>\n";
}
else{
     ifstream myfile (argv[1]); 
     if(!myfile.is_open())  //check to see if file is opened.
        cout << "Can not open your file\n";
    else{
        static int array[26];  //stores the frequency of letters.
        char t;       
    while(!myfile.eof()){     
        while(myfile.get(t)){
            if(isupper(t)){    
                int i = 0;
                do{
                    array[tolower(t)-'a']++;
                    i++;
                }
            while(i < 26);
            }
        }
    int x = 0;
    while(x < 26){
        cout << 'a' + x << ":" << array[x] << endl;
        x++;
    }
    }
    }
    }
return 0;
}

Answer 1

问题是myfile.get(t)从流中提取字符并在t中获取。现在，如果读取的字符恰好是大写的，那么您将在数组上迭代26次，增加其小写字母数。你只需要做一次。

此外，您还需要处理输入流中的非字母字符。

while(!myfile.eof()){     
    myfile.get(t);
    if(isalpha(t) { // process only alphabets.
        if(isupper(t)) // convert upper case to lower case
            t = tolower(t);
        array[t-'a']++; // increment the count
    }
}

Answer 2

这不接受命令行上的文件名（只处理其标准输入），但可能会为一般方法提供一些灵感，这可能会更简单：

#include <ctype.h>
#include <iostream>
#include <vector>
#include <fstream>
#include <iterator>
#include <algorithm>

int main() {
    std::vector<size_t> freqs(26);

    std::for_each(std::istream_iterator<char>(std::cin), 
        std::istream_iterator<char>(),
        [&](char ch) { if(isalpha(ch)) ++freqs[tolower(ch)-'a']; });

    for (int i=0; i<26; i++)
        std::cout << (char)('a'+i) << ":" << freqs[i] << "\n";

    return 0;
}

在文本文件中计算单个字母＆amp;打印结果 - 错误的输出c ++

2 个答案: