这是我的代码,以便将html转换为pdf:
public boolean create (String htmlText, String absoluteFilePath) {
try {
Document document = new Document(PageSize.LETTER);
PdfWriter pdfWriter = PdfWriter.getInstance
(document, new FileOutputStream(absoluteFilePath));
document.open();
// Fixing xhtml tag
Tidy tidy = new Tidy(); // obtain a new Tidy instance
tidy.setXHTML(true); // set desired config options using tidy setters
ByteArrayOutputStream output = new ByteArrayOutputStream();
tidy.setCharEncoding(Configuration.UTF8);
tidy.parse(new ByteArrayInputStream(htmlText.getBytes(), output);
String preparedText = output.toString("UTF-8");
Log.i("CHECKING", "JTidy Out: " + preparedText);
InputStream inputStream = new ByteArrayInputStream(preparedText.getBytes());
XMLWorkerHelper.getInstance().parseXHtml(pdfWriter, document,
inputStream, null, Charset.forName("UTF-8"), new MyFont());
document.close();
return true;
} catch (Exception e) {
File file = new File(absoluteFilePath);
if(file.exists()) {
boolean isDeleted = file.delete();
Log.i("CHECKING", "PDF isDeleted: " + isDeleted);
}
LOGGER.error("Exception: " + e.getMessage());
e.printStackTrace();
return false;
}
}
适用于以下htmlText
<p dir="ltr"><br>
wwwww<br>
--- <br>
Sent bys.</p>
<p>Original message:</p>
<blockquote>
<strong>From: </strong>
nakhmedov@s.com
<br/>
<strong>Sent: </strong>
Dec 1, 2014 5:10:19 PM
<br/>
<strong>
To:
</strong>
ssss
<br/>
<strong>Subject: </strong>
test
<br/>
<br/>
<p dir="ltr">
<br>
123<br>
--- <br>
ssssssss.</p>
</blockquote>
&#13;
它在htmlText之后不起作用:
<p dir="ltr"><br>
123<br>
--- <br>
Sent by ss.</p>
<p>Original message:</p>
<blockquote>
<strong>From: </strong>
Navruzbek Akhmedov <akhmedovnavruzbek@gmail.com>
<br/>
<strong>Sent: </strong>
Dec 1, 2014 5:14:36 PM
<br/>
<strong>
To:
</strong>
Navruzbek Akhmedov <nakhmedov@sss.com>
<br/>
<strong>Subject: </strong>
test
<br/>
<br/>
<div dir="ltr">12345</div>
</blockquote>
&#13;
请帮助我,为什么它的工作方式不同,它为secon htmlText document has no pages
提供了一个错误,并且在此tidy.parse(new ByteArrayInputStream(htmlText.getBytes("ISO-8859-1")), output);
之后输出流为空。提前谢谢!
答案 0 :(得分:1)
我最近解决了这个问题。问题是html文本中的Navruzbek Akhmedov <akhmedovnavruzbek@gmail.com>
。 iText lib在我看来<akhmedovnavruzbek@gmail.com>
就像HTML标签一样。它实际上没有在html标签列表中然后给出错误。就这样! :)))))))))))))))))))