如何使用c#中的itextsharp将字符串(包含html标记)转换为通过短语传递的PDF格式?

时间:2015-12-28 05:39:43

标签: c# html pdf itextsharp

我的输出像enter image description here

我想输出这个enter image description here

我正在生成一个pdf文件。我传递的字符串包含html标签。但是pdf中的字符串绑定为html标记。但我需要输出作为html标签中的设计。我正在将字符串注释传递给短语ph14 ...

public string GenerateQuotePdf(int id)
{
    Stream stream = Stream.Null;
    Document pdfDoc = new Document(PageSize.A4, 30F, 30F, 20F, 0F);
    try
        {
            stream = new FileStream(filePath, FileMode.Create);
            PdfWriter pdfWriter = PdfWriter.GetInstance(pdfDoc, stream);
            pdfDoc.Open();

            string notes ="<div style=\"text-align: center\"><b><span style=\"font-size: large\">Terms and Conditions</span></b></div><div>*<span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>Prices are in AED</div><div><br /></div><div>*<span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>All Credit Card transactions are subject to a 3.25% processing fee</div><div><br /></div><div>*<span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>In the event production is required per customer request, 50% of the entire bill will be due prior to start of production, and the <span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>balance due upon delivery.</div><div><br /></div><div>*<span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>All furniture will be delivered in A+ condition. In the event that the equipment is damaged, the renter shall be liable for all <span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>repair costs to restore the equipment to its state at the beginning of the rental period.</div><div><br /></div><div>*<span class=\"Apple-tab-span\" style=\"white-space: pre\">\t</span>Equipment shall be utilized for the stated purpose and at the stated location only.</div>";

            PdfPTable table14 = new PdfPTable(1);
            table14.WidthPercentage = 99;
            table14.DefaultCell.Border = 0;
            table14.DefaultCell.VerticalAlignment = Element.ALIGN_MIDDLE;
            Phrase ph14 = new Phrase(notes, textFont6);
            PdfPCell cell14 = new PdfPCell(ph14);
            cell14.VerticalAlignment = Element.ALIGN_CENTER;
            cell14.Border = 0;
            cell14.HorizontalAlignment = Element.ALIGN_CENTER;
            cell14.PaddingLeft = 10F;
            cell14.PaddingRight = 4F;
            cell14.PaddingTop = 4F;
            cell14.PaddingBottom = 6F;
            table14.AddCell(cell14);

            PdfPTable mainTable = new PdfPTable(1);
            mainTable.WidthPercentage = 100;
            PdfPCell mainCell = new PdfPCell();
            mainCell.PaddingTop = 0F;
            mainCell.PaddingRight = 10F;
            mainCell.PaddingBottom = 25F;
            mainCell.PaddingLeft = 10F;
            mainCell.Border = 0;
            mainCell.AddElement(table14);
            mainTable.AddCell(mainCell);
            pdfDoc.Add(mainTable);
        }
        catch
        {
        }
        finally
        {
            pdfDoc.Close();
            stream.Close();
            stream.Dispose();
        }
    }

2 个答案:

答案 0 :(得分:1)

我已将您的HTML复制到名为list_dirty.html的文件:

<div style="text-align: center"><b><span style="font-size: large">Terms and Conditions</span></b></div>
<div>*<span class="Apple-tab-span" style="white-space: pre"> </span>Prices are in AED</div>
<div><br /></div>
<div>*<span class="Apple-tab-span" style="white-space: pre"> </span>All Credit Card transactions are subject to a 3.25% processing fee</div>
<div><br /></div>
<div>*<span class="Apple-tab-span" style="white-space: pre"> </span>In the event production is required per customer request, 50% of the entire bill will be due prior to start of production, and the <span class="Apple-tab-span" style="white-space: pre"> </span>balance due upon delivery.</div>
<div><br /></div>
<div>*<span class="Apple-tab-span" style="white-space: pre"> </span>All furniture will be delivered in A+ condition. In the event that the equipment is damaged, the renter shall be liable for all <span class="Apple-tab-span" style="white-space: pre"> </span>repair costs to restore the equipment to its state at the beginning of the rental period.</div>
<div><br /></div>
<div>*<span class="Apple-tab-span" style="white-space: pre"> </span>Equipment shall be utilized for the stated purpose and at the stated location only.</div>

此HTML被视为“脏”,因为您要呈现列表,但您没有使用正确的标记来执行此操作。

我已使用List_Dirty示例将此HTML解析为PDF:

public void createPdf(String file) throws IOException, DocumentException {
    // step 1
    Document document = new Document();
    // step 2
    PdfWriter writer = PdfWriter.getInstance(document, new FileOutputStream(file));
    writer.setInitialLeading(12);
    // step 3
    document.open();
    // step 4
    XMLWorkerHelper.getInstance().parseXHtml(writer, document,
            new FileInputStream(HTML));
    // step 5
    document.close();
}

结果是一个如下所示的PDF:

enter image description here

如果您尊重自己作为开发人员(我假设您这样做),您应该对此结果感到满意:列表不是真正的列表。您可以通过将其作为我在文件list_clean.html中完成的真实列表来解决此问题:

<div style="text-align: center"><b><span style="font-size: large">Terms and Conditions</span></b></div>
<ul>
<li>Prices are in AED</li>
<li>All Credit Card transactions are subject to a 3.25% processing fee</li>
<li>In the event production is required per customer request, 50% of the entire bill will be due prior to start of production, and the balance due upon delivery.</li>
<li>All furniture will be delivered in A+ condition. In the event that the equipment is damaged, the renter shall be liable for all repair costs to restore the equipment to its state at the beginning of the rental period.</li>
<li>Equipment shall be utilized for the stated purpose and at the stated location only.</li>
</ul>

我已使用List_Clean示例将此HTML解析为PDF:

public void createPdf(String file) throws IOException, DocumentException {
    // step 1
    Document document = new Document();
    // step 2
    PdfWriter writer = PdfWriter.getInstance(document, new FileOutputStream(file));
    writer.setInitialLeading(12);
    // step 3
    document.open();
    // step 4
    XMLWorkerHelper.getInstance().parseXHtml(writer, document,
            new FileInputStream(HTML)); 
    // some code that will be explained later
    // step 5
    document.close();
}

此代码与我们之前的代码有什么不同?没有!只有HTML不同,导致PDF看起来像这样:

enter image description here

这已经更好了,但是在您的代码中,您有更多的间距,并且您还将列表添加到PdfPTable。这就是我在List_Clean示例中第二次添加相同HTML的原因:

String html = Utilities.readFileToString(HTML);
String css = "ul { list-style: disc } li { padding: 10px }";
PdfPTable table = new PdfPTable(1);
table.setSpacingBefore(20);
PdfPCell cell = new PdfPCell();
for (Element e : XMLWorkerHelper.parseToElementList(html, css)) {
    cell.addElement(e);
}
table.addCell(cell);
document.add(table);

你看到了区别吗?我正在使用CSS来定义列表符号,我正在为每个列表项定义填充。我没有将HTML直接呈现给DocumentPdfWriter,而是将HTML和CSS解析为元素列表。然后我将每个元素添加到PdfPCell

结果如下:

enter image description here

这看起来很不错,不是吗?有关使用iText(Sharp)和XML Worker的更多信息,请查看official documentationXML Worker examples / XML Worker FAQ

答案 1 :(得分:0)

通常,我使用此方法手动将HTML标记转换为纯文本

public static string ConvertHtmlToPlainText(string text)
{
text = HttpUtility.HtmlDecode(text);

text = text.Replace("<br>", "\n");
text = text.Replace("<br >", "\n");
text = text.Replace("<br />", "\n");
text = text.Replace("&nbsp;&nbsp;", "\t");


text = text.Replace("&nbsp;&nbsp;", "  ");

text = ReplaceAnchorTags(text);

return text;
}