Question

我正在构建一个工具来自动创建包含表和关联的数据透视表的Excel工作簿。表结构位于一个工作表中，其数据将在稍后使用另一个工具从数据库中提取。数据透视表位于第二张表上，使用上一页中的表作为源。

我正在使用EPPlus来帮助构建工具，但遇到了指定cacheSource的问题。我正在使用以下内容创建范围和数据透视表：

 var dataRange = dataWorksheet.Cells[dataWorksheet.Dimension.Address.ToString()];

 var pivotTable = pivotWorksheet.PivotTables.Add(pivotWorksheet.Cells["B3"], dataRange, name);

这会将cacheSource设置为：

<x:cacheSource type="worksheet" xmlns:x="http://schemas.openxmlformats.org/spreadsheetml/2006/main">
<x:worksheetSource ref="A1:X2" sheet="dataWorksheet" />

或在Excel中，数据源设置为：

dataWorksheet!$A$1:$X$2

如果表大小永远不会改变，这样可以正常工作，但由于行数是动态的，我发现刷新数据时，只能从指定的初始范围读取数据。

我想要做的是以编程方式将cacheSource设置为：

<x:cacheSource type="worksheet" xmlns:x="http://schemas.openxmlformats.org/spreadsheetml/2006/main">
  <x:worksheetSource name="dataWorksheet" />
</x:cacheSource>

或在Excel中，将数据源设置为：

dataWorksheet

我相信有可能通过直接访问XML来做到这一点（任何关于此的指针都是最受欢迎的）但有没有办法使用EPPlus来做到这一点？

Answer 1

它可以做到，但它不是世界上最漂亮的东西。您可以从创建的EPPlus数据透视表对象中提取缓存def xml并对其进行编辑，但是当您调用package.save()（或GetAsByteArray()）时，这会对保存逻辑造成严重破坏，因为它会在保存时解析xml生成最终文件。正如你所说，这就是EPPlus无法处理表格作为来源的结果。

因此，您可以选择使用EPPlus正常保存文件，然后使用.net ZipArchive操作xlsx的内容，该文件是重命名的zip文件。诀窍是你不能在zip中乱序处理文件，否则Excel会在打开文件时抱怨。而且由于您无法插入条目（仅添加到结尾），您必须重新创建zip。这是ZipArchive上的一种扩展方法，允许您更新缓存源：

public static bool SetCacheSourceToTable(this ZipArchive xlsxZip, FileInfo destinationFileInfo, string tablename, int cacheSourceNumber = 1)
{
    var cacheFound = false;
    var cacheName = String.Format("pivotCacheDefinition{0}.xml", cacheSourceNumber);

    using (var copiedzip = new ZipArchive(destinationFileInfo.Open(FileMode.Create, FileAccess.ReadWrite), ZipArchiveMode.Update))
    {
        //Go though each file in the zip one by one and copy over to the new file - entries need to be in order
        xlsxZip.Entries.ToList().ForEach(entry =>
        {
            var newentry = copiedzip.CreateEntry(entry.FullName);
            var newstream = newentry.Open();
            var orgstream = entry.Open();

            //Copy all other files except the cache def we are after
            if (entry.Name != cacheName)
            {
                orgstream.CopyTo(newstream);
            }
            else
            {
                cacheFound = true;

                //Load the xml document to manipulate
                var xdoc = new XmlDocument();
                xdoc.Load(orgstream);

                //Get reference to the worksheet xml for proper namespace
                var nsm = new XmlNamespaceManager(xdoc.NameTable);
                nsm.AddNamespace("default", xdoc.DocumentElement.NamespaceURI);

                //get the source
                var worksheetSource = xdoc.SelectSingleNode("/default:pivotCacheDefinition/default:cacheSource/default:worksheetSource", nsm);

                //Clear the attributes
                var att = worksheetSource.Attributes["ref"];
                worksheetSource.Attributes.Remove(att);

                att = worksheetSource.Attributes["sheet"];
                worksheetSource.Attributes.Remove(att);

                //Create the new attribute for table
                att = xdoc.CreateAttribute("name");
                att.Value = tablename;
                worksheetSource.Attributes.Append(att);

                xdoc.Save(newstream);
            }

            orgstream.Close();
            newstream.Flush();
            newstream.Close();
        });
    }

    return cacheFound;

}

以下是如何使用它：

//Throw in some data
var datatable = new DataTable("tblData");
datatable.Columns.AddRange(new[]
{
    new DataColumn("Col1", typeof (int)), new DataColumn("Col2", typeof (int)), new DataColumn("Col3", typeof (object))
});

for (var i = 0; i < 10; i++)
{
    var row = datatable.NewRow();
    row[0] = i; row[1] = i*10; row[2] = Path.GetRandomFileName();
    datatable.Rows.Add(row);
}

const string tablename = "PivotTableSource";
using (var pck = new ExcelPackage())
{
    var workbook = pck.Workbook;

    var source = workbook.Worksheets.Add("source");
    source.Cells.LoadFromDataTable(datatable, true);
    var datacells = source.Cells["A1:C11"];

    source.Tables.Add(datacells, tablename);

    var pivotsheet = workbook.Worksheets.Add("pivot");
    pivotsheet.PivotTables.Add(pivotsheet.Cells["A1"], datacells, "PivotTable1");

    using (var orginalzip = new ZipArchive(new MemoryStream(pck.GetAsByteArray()), ZipArchiveMode.Read))
    {
        var fi = new FileInfo(@"c:\temp\Pivot_From_Table.xlsx");
        if (fi.Exists)
            fi.Delete(); 

        var result = orginalzip.SetCacheSourceToTable(fi, tablename, 1);
        Console.Write("Cache source was updated: ");
        Console.Write(result);
    }
}

将表而不是范围定义为数据透视表＆lt; cacheSource＆＃39;

1 个答案: