我必须在<<
和>>
之间替换字符串。但是我无法这样做。
public void doIt( String inputFile, String outputFile) throws IOException, COSVisitorException
{
PDDocument doc = null;
try
{
doc = PDDocument.load( inputFile );
List pages = doc.getDocumentCatalog().getAllPages();
for( int i=0; i<pages.size(); i++ )
{
PDPage page = (PDPage)pages.get( i );
PDStream contents = page.getContents();
PDFStreamParser parser = new PDFStreamParser(contents.getStream());
parser.parse();
List tokens = parser.getTokens();
for( int j=0; j<tokens.size(); j++ )
{
Object next = tokens.get( j );
if( next instanceof PDFOperator )
{
PDFOperator op = (PDFOperator)next;
if( op.getOperation().equals( "Tj" ))
{
Scanner in = new Scanner(System.in);
COSString previous = (COSString)tokens.get( j-1 );
String string = previous.getString();
if(string.startsWith("<<") && string.endsWith(">>"))
{
System.out.println(string);
System.out.println("enter the word to be replaced");
String string2=in.nextLine();
string = string.replaceAll( string, string2 );
previous.reset();
previous.append( string.getBytes() );
}
}
else if( op.getOperation().equals( "TJ" ))
{
COSArray previous = (COSArray)tokens.get( j-1 );
for( int k=0; k<previous.size(); k++ )
{
Scanner in = new Scanner(System.in);
Object arrElement = previous.getObject( k );
if(arrElement instanceof COSString)
{
COSString cosString = (COSString)arrElement;
String string = cosString.getString();
if(string.startsWith("<<") && string.endsWith(">>"))
{
System.out.println(string);
System.out.println("enter the word to be replaced");
String string2=in.nextLine();
string = string.replaceAll( string, string2 );
cosString.reset();
cosString.append( string.getBytes());
}
}
}
}
}
}
PDStream updatedStream = new PDStream(doc);
OutputStream out = updatedStream.createOutputStream();
ContentStreamWriter tokenWriter = new ContentStreamWriter(out);
tokenWriter.writeTokens(tokens);
page.setContents(updatedStream);
}
doc.save( outputFile );
System.out.println("Done!! Now You can Open.");
}
finally
{
if( doc != null )
{
doc.close();
}
}
}
答案 0 :(得分:3)
请阅读chapter 6 of my book的介绍。您假设PDF是用于编辑文本的格式。 PDF不是为文字处理而设计的。
当然:也许您正在询问如何创建静态表单,如本书第6.3.5节所述,但我怀疑AcroForm技术的静态性能是否满足您的需求。纯XFA表单(动态PDF)可以解决您的问题,但解释XFA不是可以在SO的答案范围内完成的。 XFA规范长达数百页。正如邓肯琼斯的评论所示,你应该先做一些初步的工作。