您好我在Windows服务器上使用apache solr 3.1 在cmd中索引“不支持/禁用操作EI”PDFStreamEngine时,我看到异常 我有谷歌这个,但找不到任何解决方案
Apr 4, 2012 3:33:21 AM org.apache.solr.common.SolrException log
SEVERE: Exception in entity : null:org.apache.solr.handler.dataimport.DataImport
HandlerException: Unable to read content Processing Document # 3029
at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAnd
Throw(DataImportHandlerException.java:72)
at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEn
tityProcessor.java:130)
at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(Ent
ityProcessorWrapper.java:238)
at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde
r.java:591)
at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde
r.java:617)
at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.j
ava:267)
at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java
:186)
at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImpo
rter.java:353)
at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.j
ava:411)
at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.ja
va:392)
Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@1a8e75a
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199
)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
35)
at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEn
tityProcessor.java:128)
... 8 more
Caused by: java.lang.NullPointerException
at org.apache.pdfbox.pdmodel.PDPageNode.getCount(PDPageNode.java:109)
at org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java
:943)
at org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:1
07)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:88)
at org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:91)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197
)
... 10 more
Apr 4, 2012 3:33:22 AM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EI
请帮助
谢谢
答案 0 :(得分:1)
这实际上是来自PDFBox的消息。这意味着PDF包含PDFBox不支持的运算符。更多细节可以在这里找到: