使用直接Javascript解析CSV文件

时间:2014-05-20 15:22:30

标签: javascript arrays parsing csv

我正在尝试创建一个相当简单的“Webapp”来简化处理记录。

起始CSV文件如下:

HeaderA,HeaderB,HeaderC
UserA-first,UserA-last,UserA-Active
UserB-first,UserB-last,UserB-Active
UserC-first,UserC-last,UserC-Active

我想获取此数据并创建以下数组

Var ColumnA = ["HeaderA", "UserA-first", "UserB-first", "UserC-first"];
Var ColumnB = ["HeaderB", "UserA-last", "UserB-last", "UserC-last"];
Var ColumnC = ["HeaderC", "UserA-Active", "UserB-Active", "UserC-Active"];

在我将它们放入数组后,我确信我可以按照我想要的方式迭代它们。

我遇到的问题是:

  1. 如何解析第三列之后没有逗号的CSV文件
  2. 如何使用直接Javascript(无外部库)
  3. 执行此操作

    这是我关于Stack Overflow的第一个问题,所以请对我可能犯过的任何错误保持温和:)

1 个答案:

答案 0 :(得分:0)

kei将您链接到帖子,这是您的解决方案。

只需包含此功能并将其用于您的数据......

function CSVToArray( strData, strDelimiter ){
        // Check to see if the delimiter is defined. If not,
        // then default to comma.
        strDelimiter = (strDelimiter || ",");

        // Create a regular expression to parse the CSV values.
        var objPattern = new RegExp(
            (
                // Delimiters.
                "(\\" + strDelimiter + "|\\r?\\n|\\r|^)" +

                // Quoted fields.
                "(?:\"([^\"]*(?:\"\"[^\"]*)*)\"|" +

                // Standard fields.
                "([^\"\\" + strDelimiter + "\\r\\n]*))"
            ),
            "gi"
            );


        // Create an array to hold our data. Give the array
        // a default empty first row.
        var arrData = [[]];

        // Create an array to hold our individual pattern
        // matching groups.
        var arrMatches = null;


        // Keep looping over the regular expression matches
        // until we can no longer find a match.
        while (arrMatches = objPattern.exec( strData )){

            // Get the delimiter that was found.
            var strMatchedDelimiter = arrMatches[ 1 ];

            // Check to see if the given delimiter has a length
            // (is not the start of string) and if it matches
            // field delimiter. If id does not, then we know
            // that this delimiter is a row delimiter.
            if (
                strMatchedDelimiter.length &&
                (strMatchedDelimiter != strDelimiter)
                ){

                // Since we have reached a new row of data,
                // add an empty row to our data array.
                arrData.push( [] );

            }


            // Now that we have our delimiter out of the way,
            // let's check to see which kind of value we
            // captured (quoted or unquoted).
            if (arrMatches[ 2 ]){

                // We found a quoted value. When we capture
                // this value, unescape any double quotes.
                var strMatchedValue = arrMatches[ 2 ].replace(
                    new RegExp( "\"\"", "g" ),
                    "\""
                    );

            } else {

                // We found a non-quoted value.
                var strMatchedValue = arrMatches[ 3 ];

            }


            // Now that we have our value string, let's add
            // it to the data array.
            arrData[ arrData.length - 1 ].push( strMatchedValue );
        }

        // Return the parsed data.
        return( arrData );
    }

如果你想要一个不同的数组结构,你可能需要对函数做一些调整,但最后它是你正在寻找的函数