Question

我正在将Java与Apache Commons-IO一起使用来下载PDF，但是我只想获得第一页，有什么办法可以做到？

这是获取整个文档的代码：

public void getPDF(String route) throws IOException {
    URL url = new URL(route);
    File file = new File("file.pdf");
    FileUtils.copyURLToFile(url, file);
}

Answer 1

继代码之后，您可以使用新文档仅保留给定PDF文件的第一页。

 URL url = new URL(route);
 File file = new File("file.pdf");
 FileUtils.copyURLToFile(url, file);

 PDDocument pdDoc = PDDocument.load(file);
 PDDocument document = null;

int pageNumberToRead=0;

try {   
    document = new PDDocument();   
    document.addPage((PDPage) pdDoc.getDocumentCatalog().getAllPages().get(pageNumberToRead));   
    document.save("basepath/first_page.pdf");  
    document.close();  
}catch(Exception e){}

Apache Commons IO仅下载第一个PDF页面

1 个答案: