加载时从Excel文件中剥离所有格式

时间:2019-02-15 10:42:12

标签: c# excel winforms csv

我想在加载Excel文件时将所有格式(边界等)从Excel文件中删除,然后再将数据填充到数据表中。

运行代码时,updateExcel_Click部分用每行ConsigneeCombo框中的列更新C列,但是,如果我正在处理的文件具有格式设置,例如10行带有边框,但其中只有8行带有文本,由于格式,它会更新所有10行

编辑

除了删除边框,updateExcel_Click部分中仅将其添加到包含文本的行中怎么办?

private void updateExcel_Click(object sender, EventArgs e)
{
    for (int i = 0; i < dataGridView1.RowCount - 1; i++)
    {
        dataGridView1[2, i].Value = ConsigneeCombo.Text;
    }
}

我当前的GetData代码是:

    private DataTable GetData(string userFileName)
    {
        string dirName = Path.GetDirectoryName(userFileName);
        string fileName = Path.GetFileName(userFileName);
        string fileExtension = Path.GetExtension(userFileName);
        string connection = string.Empty;
        string query = string.Empty;
        switch (fileExtension)
        {
            case ".xls":
                connection = $@"Provider=Microsoft.Jet.OLEDB.4.0;Data Source={userFileName};" +
                             "Extended Properties=\"Excel 8.0; HDR=Yes; IMEX=1\"";
                string sheetNamexls;
                using (OleDbConnection con = new OleDbConnection(connection))
                {
                    con.Open();
                    var dtSchema = con.GetOleDbSchemaTable(OleDbSchemaGuid.Tables, new object[] { null, null, null, "TABLE" });
                    sheetNamexls = dtSchema.Rows[0].Field<string>("TABLE_NAME");
                }

                if (sheetNamexls.Length <= 0) throw new InvalidDataException("No sheet found.");

                query = $"SELECT * FROM [{sheetNamexls}]";
                break;

            case ".xlsx":
                connection = $@"Provider=Microsoft.ACE.OLEDB.12.0;Data Source={userFileName};" +
                             "Extended Properties=\"Excel 12.0; HDR=Yes; IMEX=1\"";
                string sheetName;
                using (OleDbConnection con = new OleDbConnection(connection))
                {
                    con.Open();
                    var dtSchema = con.GetOleDbSchemaTable(OleDbSchemaGuid.Tables, new object[] { null, null, null, "TABLE" });
                    sheetName = dtSchema.Rows[0].Field<string>("TABLE_NAME");

                }

                if (sheetName.Length <= 0) throw new InvalidDataException("No sheet found.");

                query = $"SELECT * FROM [{sheetName}]";
                break;
            case ".csv":
                connection = $@"Provider=Microsoft.ACE.OLEDB.12.0;Data Source={dirName};" +
                               "Extended Properties=\"text; HDR=Yes; IMEX=1; FMT=Delimited\"";
                query = $"SELECT * FROM [{fileName}]";
                break;
        }
        return FillData(connection, query);
    }

我尝试添加ClearFormats();方法,但无法使其正常工作。

完整代码:

using System;
using System.Data;
using System.Text;
using System.Windows.Forms;
using System.IO;
using System.Data.OleDb;
using System.Data.SqlClient;

namespace DrayIn
{
    public partial class DrayIn : Form
    {
        public DrayIn()
        {
            InitializeComponent();
            using (SqlConnection sqlConnection = new SqlConnection("ConnDetails"))
            {
                SqlCommand sqlCmd = new SqlCommand(@"SELECT Id
                                                    FROM ref_bizunit_scoped sh
                                                    WHERE sh.role = 'SHIPPER'
                                                    AND sh.Life_Cycle_State = 'ACT'
                                                    ORDER BY ID", sqlConnection);
                sqlConnection.Open();
                SqlDataReader sqlReader = sqlCmd.ExecuteReader();
                while (sqlReader.Read())
                {
                    ConsigneeCombo.Items.Add(sqlReader["Id"].ToString());
                }
                sqlReader.Close();
            }
            ConsigneeCombo.SelectedIndex = 0;
        }

        private DataTable FillData(string connection, string query)
        {
            DataTable dataTable = new DataTable();
            using (OleDbConnection con = new OleDbConnection(connection))
            {
                con.Open();
                OleDbDataAdapter adapter = new OleDbDataAdapter(query, con);

                adapter.Fill(dataTable);
                adapter.Dispose();
            };
            return dataTable;
        }

    private DataTable GetData(string userFileName)
    {
        string dirName = Path.GetDirectoryName(userFileName);
        string fileName = Path.GetFileName(userFileName);
        string fileExtension = Path.GetExtension(userFileName);
        string connection = string.Empty;
        string query = string.Empty;
        switch (fileExtension)
        {
            case ".xls":
                connection = $@"Provider=Microsoft.Jet.OLEDB.4.0;Data Source={userFileName};" +
                             "Extended Properties=\"Excel 8.0; HDR=Yes; IMEX=1\"";
                string sheetNamexls;
                using (OleDbConnection con = new OleDbConnection(connection))
                {
                    con.Open();
                    var dtSchema = con.GetOleDbSchemaTable(OleDbSchemaGuid.Tables, new object[] { null, null, null, "TABLE" });
                    sheetNamexls = dtSchema.Rows[0].Field<string>("TABLE_NAME");
                }

                if (sheetNamexls.Length <= 0) throw new InvalidDataException("No sheet found.");

                query = $"SELECT * FROM [{sheetNamexls}]";
                break;

            case ".xlsx":
                connection = $@"Provider=Microsoft.ACE.OLEDB.12.0;Data Source={userFileName};" +
                             "Extended Properties=\"Excel 12.0; HDR=Yes; IMEX=1\"";
                string sheetName;
                using (OleDbConnection con = new OleDbConnection(connection))
                {
                    con.Open();
                    var dtSchema = con.GetOleDbSchemaTable(OleDbSchemaGuid.Tables, new object[] { null, null, null, "TABLE" });
                    sheetName = dtSchema.Rows[0].Field<string>("TABLE_NAME");

                }

                if (sheetName.Length <= 0) throw new InvalidDataException("No sheet found.");

                query = $"SELECT * FROM [{sheetName}]";
                break;
            case ".csv":
                connection = $@"Provider=Microsoft.ACE.OLEDB.12.0;Data Source={dirName};" +
                               "Extended Properties=\"text; HDR=Yes; IMEX=1; FMT=Delimited\"";
                query = $"SELECT * FROM [{fileName}]";
                break;
        }
        return FillData(connection, query);
    }

    private void Browse_Click(object sender, EventArgs e)
    {
        fileTextBox.Visible = true;
        ConsigneeCombo.Visible = true;
        updateExcel.Visible = true;
        dataGridView1.Visible = true;
        saveExcel.Visible = true;
        consigneeLabel.Visible = true;
        fileLabel.Visible = true;
        string userFileNameUT = string.Empty;
        string fileExtensionUT = string.Empty;
        using (OpenFileDialog ofd = new OpenFileDialog())
        {
            ofd.InitialDirectory = Environment.GetFolderPath(Environment.SpecialFolder.Desktop);
            ofd.Filter = "CSV Files|*.csv|Excel '97-2003|*.xls|Excel 2007-2019|*.xlsx";
            if (ofd.ShowDialog(this) == DialogResult.OK) 
            {
                fileExtensionUT = Path.GetExtension(ofd.FileName);
                userFileNameUT = ofd.FileName;
            }
            else
            {
                fileTextBox.Visible = false;
                ConsigneeCombo.Visible = false;
                updateExcel.Visible = false;
                dataGridView1.Visible = false;
                saveExcel.Visible = false;
                consigneeLabel.Visible = false;
                fileLabel.Visible = false;
            }
        }
        string extensionMix = string.Empty;
        if (fileExtensionUT == ".csv")  extensionMix = ".csv";
        else if (fileExtensionUT == ".xls") extensionMix = ".xls";
        else if (fileExtensionUT == ".xlsx") extensionMix = ".xlsx";
        if (userFileNameUT.Length == 0) return;
        string userFileName = Path.Combine(Path.GetDirectoryName(userFileNameUT), Path.GetFileNameWithoutExtension(userFileNameUT.Replace(".", "")) + extensionMix);
        File.Copy(userFileNameUT, userFileName, true);
        this.dataGridView1.DataSource = GetData(userFileName);
        fileTextBox.Text = userFileNameUT;
        textBox4.Text = userFileName;
        textBox1.Text = Path.GetFileName(userFileNameUT);
    }

    private void updateExcel_Click(object sender, EventArgs e)
    {
        for (int i = 0; i < dataGridView1.RowCount - 1; i++)
        {
            dataGridView1[2, i].Value = ConsigneeCombo.Text;
        }
    }

    public void ToCsV(DataGridView dGV, string filename)
    {
        string stOutput = "";
        string sHeaders = "";
        for (int j = 0; j < dataGridView1.Columns.Count; j++)
            sHeaders = sHeaders.ToString() + Convert.ToString(dataGridView1.Columns[j].HeaderText) + ",";
        stOutput += sHeaders + "\r\n";
        for (int i = 0; i < dataGridView1.RowCount - 1; i++)
        {
            string stLine = "";
            for (int j = 0; j < dataGridView1.Rows[i].Cells.Count; j++)
                stLine = stLine.ToString() + Convert.ToString(dataGridView1.Rows[i].Cells[j].Value) + ",";
            stOutput += stLine + "\r\n";
        }
        Encoding utf16 = Encoding.GetEncoding(1254);
        byte[] output = utf16.GetBytes(stOutput);
        FileStream fs = new FileStream(filename, FileMode.Create);
        BinaryWriter bw = new BinaryWriter(fs);
        bw.Write(output, 0, output.Length); 
        bw.Flush();
        bw.Close();
        fs.Close();
    }

    private void saveExcel_Click_1(object sender, EventArgs e)
    {
        SaveFileDialog sfd = new SaveFileDialog();
        sfd.Title = "Save Excel Files";
        sfd.CheckPathExists = true;
        sfd.DefaultExt = "csv";
        sfd.Filter = "Excel Files|*.csv";
        string saveFileName = textBox1.Text;
        string fileExtensionTrim = Path.GetExtension(saveFileName);
        string subFinalSaveName = textBox1.Text;
        string finalSaveName = Path.GetFileNameWithoutExtension(subFinalSaveName) + ".csv";
        textBox3.Text = finalSaveName;
        sfd.FileName = finalSaveName;
        sfd.InitialDirectory = @"C:";
        if (sfd.ShowDialog() == DialogResult.OK)
        {
            ToCsV(dataGridView1, sfd.FileName);
            string userFileName = textBox4.Text;
            File.Delete(userFileName);
            fileTextBox.Visible = false;
            ConsigneeCombo.Visible = false;
            updateExcel.Visible = false;
            dataGridView1.Visible = false;
            saveExcel.Visible = false;
            consigneeLabel.Visible = false;
            fileLabel.Visible = false;
        }
        else
        {
            fileTextBox.Visible = true;
            ConsigneeCombo.Visible = true;
            updateExcel.Visible = true;
            dataGridView1.Visible = true;
            saveExcel.Visible = true;
            consigneeLabel.Visible = true;
            fileLabel.Visible = true;
        }
    }
}
}

2 个答案:

答案 0 :(得分:1)

  

运行代码时,updateExcel_Click部分将列C更新为   ConsigneeCombo框中每一行的内容,但是如果我是   处理具有格式化,例如10行带边框,但只有8行   其中的每一行都带有文本,由于格式,它会全部更新10

马特,很抱歉,但是您发布的代码与Excel无关。它无条件更新dataGridView1单元。因此,如果您只想更新部分单元格,则必须添加条件:

private void updateExcel_Click(object sender, EventArgs e)
{
    for (int i = 0; i < dataGridView1.RowCount - 1; i++)
    {
        if(_your_logic_here_)
            dataGridView1[2, i].Value = ConsigneeCombo.Text;
    }
}

但是,我确实相信这不是您想要的,因为您正在使用OleDb提供程序来获取/获取Excel数据。

请注意,OleDb提供程序公开了提供CRUD操作的方法。您可以通过OleDbCommand INSERT(创建),SELECT(读取),UPDATE(修改)和DELETE(销毁)Excel数据。

因此,如果您想UPDATE数据,请使用以下语句:

UPDATE [SheetNameOrTableName$]
    SET FieldName=NewValue
    WHERE FieldName=OldValue

您必须将其作为字符串传递给OleDbCommand.Command

string sSQL = @"UPDATE [SheetNameOrTableName$]
    SET FieldName=?
    WHERE FieldName=?";

string sSQL = @"UPDATE [SheetNameOrTableName$]
    SET FieldName=@param1
    WHERE FieldName=@param2";

但是我必须警告您:JET / ACE的OleDb提供程序无法识别命名参数!因此,您必须按正确的顺序向OleDbCommand添加参数!

最后,我建议您重新考虑您的应用程序,并将业务逻辑与数据访问分开。参见:
Creating a Data Access Layer (C#)
Creating a Business Logic Layer (C#)
Writing a Portable Data Access Layer

以上文章提供了ASP.NET页面的信息,但WinForms的逻辑必须相同!

DAL类的Excel文件的一部分可能如下所示:

public class ExcelDAL
{
    private string sExcelFile = string.Empty;
    private string sExcelSheet = string.Empty;
    private bool bUseHeaders = false;
    private bool bUseIMEX = false;

    public ExcelDAL(string _ExcelFile, string _ExcelSheet, bool _UseHeaders, bool _UseIMEX)
    {
        sExcelFile = _ExcelFile;
        sExcelSheet = _ExcelSheet;
        bUseHeaders = _UseHeaders;
    }


    private string GetConnString()
    {
        string suh = bUseHeaders ? "YES" : "NO";
        string sui = bUseIMEX ? "IMEX=1;" : "";
        return string.Format("Provider=Microsoft.ACE.OLEDB.12.0;Data Source={0};Extended Properties='Excel 12.0;HDR={1}';{2}", sExcelFile, suh, sui);
    }

    public DataTable GetSheetData()
    {
        DataTable dt = new DataTable();
        using (OleDbConnection connection = new OleDbConnection(GetConnString()))
        {
            string sql = string.Format(@"SELECT * FROM [{0}$];", sExcelSheet);
            connection.Open();
            using(OleDbCommand command = new OleDbCommand(sql, connection))
            {
                using (OleDbDataReader reader = command.ExecuteReader())
                {
                    dt.Load(reader);
                }
            }
        }
        return dt;
    }

    //other members and methods of DAL class
}

随时根据您的需求改进ExcelDAL类。

祝你好运!

答案 1 :(得分:1)

我同意@Maciej Los,您的问题似乎集中在“ Excel”中,但是在将ComboBox中的文本添加到第三行中时,代码在“ Excel”中没有任何作用DataGridView中所有行的列。这很令人困惑,我将从DataGridView的角度开始,因为这是当前代码所使用的。

根据您的评论…

  

....如果我正在处理的文件具有格式,例如10行   带有边框,但其中只有8行带有文本,它将更新所有10个边框   因为格式。

这不一定是准确的……由于“格式”,代码没有更新它们……因为有十(10)行,因此正在“更新”它们! ……发布的代码只是在网格中的所有行中循环。它不检查任何格式,也不检查行是否为“空”!

当您在空白单元格中“读取”具有单元格格式的“ Excel”文件时(如您所述)……它将在读取时被拾取,甚至在数据源中也将变成“行”,甚至尽管所有单元格可能都是空的。这是一个“ Excel”问题,我知道有一个解决方案,它将在您的代码读取“ Excel”文件之前“删除”所有这些“空”单元格,从而从头开始“消除”这些“空”行。 >

我希望我不会错过任何东西...。

要使用DatGridView进行此操作,可以创建一个小的方法,该方法给定网格中的行索引,如果行为“空”文本,则返回true。从现有的updateExcel_Click调用此方法……可能看起来像下面的样子……

 private void updateExcel_Click(object sender, EventArgs e) {
  for (int i = 0; i < dataGridView1.RowCount - 1; i++) {
    if (!RowIsEmpty(i)) {
      dataGridView1[2, i].Value = ConsigneeCombo.Text;
    }
  }
}

private bool RowIsEmpty(int rowIndex) {
  for (int i = 0; i < dataGridView1.ColumnCount; i++) {
    if (dataGridView1.Rows[rowIndex].Cells[i].Value != null &&
        dataGridView1.Rows[rowIndex].Cells[i].Value.ToString() != "") {
      return false;
    }
  }
  return true;
}

关于从Excel文件中删除“空格式”的单元格……

Fastest method to remove Empty rows and Columns From Excel Files using Interop

可能会有所帮助。我知道它使用“互操作”,但是,我相信使用OLEDB实施它并不困难。基本上,将Excel工作表中的“ usedRange”读入对象数组,该数组将删除此格式。

如果我缺少导入内容,请告诉我。希望这会有所帮助。