Question

我有一个非常大的.txt文件（9 MB）。其中的单词存储如下：

.txt文件中的每一行都包含一个字符串，后跟一个空格和一个数字我想只获取单词并将它们存储在字符串数组中。我发现正则表达式在这里会有点过分，但是没有想到另一种方式，因为我不熟悉c ++中的流。

Answer 1

与以下示例类似

#include <bits/stdc++.h>
using namespace std;

int main() {
    vector<string> strings;
    ifstream file("path_to_file");
    string line;
    while (getline(file, line))
        strings.push_back(line.substr(0, line.find(" ")));

    // Do whatever you want with 'strings' vector
}

Answer 2

您应该逐行阅读文件，并且对于每一行使用字符串的substr()方法根据空间位置解析一行，您可以使用find()方法查找位置分隔符。取空间之前的部分，忽略休息。

您可以查看here作为示例。

如何有效地只读取大文本文件中的字符串

2 个答案: