下载CSV大文件并放入Google表格中

时间:2019-01-09 08:38:52

标签: google-apps-script google-sheets google-drive-api csv-import

这是我在自动化领域的一个小项目。我会定期检索电子邮件报告CSV附件,并使用Google App脚本将其直接转换为Google表格。但是其中有一个报告太大,不适合Blob限制大小(50mb),将出现执行错误。

因此,无法下载并将其存储在Google驱动器中。

我试图存储contentText并使用我在网上找到的CSVToArray函数

function CSVToArray( strData, strDelimiter ) {
  // Check to see if the delimiter is defined. If not,
  // then default to COMMA.
  strDelimiter = (strDelimiter || ",");
  // Create a regular expression to parse the CSV values.
  var objPattern = new RegExp(
    (
      // Delimiters.
      "(\\" + strDelimiter + "|\\r?\\n|\\r|^)" +

      // Quoted fields.
      "(?:\"([^\"]*(?:\"\"[^\"]*)*)\"|" +

      // Standard fields.
      "([^\"\\" + strDelimiter + "\\r\\n]*))"
    ),
    "gi"
  );

  // Create an array to hold our data. Give the array
  // a default empty first row.
  var arrData = [[]];

  // Create an array to hold our individual pattern
  // matching groups.
  var arrMatches = null;

  // Keep looping over the regular expression matches
  // until we can no longer find a match.
  while (arrMatches = objPattern.exec( strData )){
    // Get the delimiter that was found.
    var strMatchedDelimiter = arrMatches[ 1 ];
    // Check to see if the given delimiter has a length
    // (is not the start of string) and if it matches
    // field delimiter. If id does not, then we know
    // that this delimiter is a row delimiter.
    if (
      strMatchedDelimiter.length &&
      (strMatchedDelimiter != strDelimiter)
    ){

      // Since we have reached a new row of data,
      // add an empty row to our data array.
      arrData.push( [] );

    }
    // Now that we have our delimiter out of the way,
    // let's check to see which kind of value we
    // captured (quoted or unquoted).
    if (arrMatches[ 2 ]){
      // We found a quoted value. When we capture
      // this value, unescape any double quotes.
      var strMatchedValue = arrMatches[ 2 ].replace(
        new RegExp( "\"\"", "g" ),
        "\""
      );
    } else {
      // We found a non-quoted value.
      var strMatchedValue = arrMatches[ 3 ];
    }
    // Now that we have our value string, let's add
    // it to the data array.
    arrData[ arrData.length - 1 ].push( strMatchedValue );
  }
  // Return the parsed data.
  Logger.log(arrData);
  return( arrData );
};
function GetCSVFromLink(link){

  var urlData = UrlFetchApp.fetch(link);
  var stringData = urlData.getContentText(); 
  //
  //All the folder creation etc is here
  //
    var CSVArray = CSVToArray(stringData);   
    var newsheet = ss.insertSheet("NewReport");
    for ( var i =0, lenCsv=CSVArray.length; i<lenCsv;i++)
    {
     newsheet.getRange(i+1,1,1,CSVArray[i].length).setValues(new Array(CSVArray[i]));

    }

最后,我达到了“最大执行时间”。这个特定的报告有3万行,因此即使30分钟的长时间执行也无法完成。但是,这适用于其他较小的csv文件。(但是当我可以通过Drive API直接转换为Google表格时,则不想这样做

我还发现,如果将其从CSV转换为xlsm,它将更小,并且在那里转换也更容易。但是问题是我无法将CSV文件自动下载到我的云端硬盘中,而且我不知道如何使用App脚本将CSV转换为xlsm。

还有其他解决方法吗?还是你们认为还有其他方法可行?

1 个答案:

答案 0 :(得分:1)

您可能可以通过Drive API利用可恢复的上传。请参阅Tanaike's解决方案:

Resumable Upload for Web Apps using Google Apps Script