关于使用Java处理大型XLS文件的API / Framework的建议

时间:2013-01-15 08:47:59

标签: java open-source xls

我需要技术选择来使用Java处理更大的XLS文件

  1. 它需要处理没有内存不足异常的xmls文件

  2. 需要针对内存(工作内存)使用情况进行优化

  3. 建议

1 个答案:

答案 0 :(得分:1)

Apache POI framewor是最佳选择 - link

它的event-based-api有助于实现最佳内存优化

如果按照here

所示正确实施,它可以处理更大的Excel工作表
  

示例读取大文件:

    public void parseExcel(File file) throws IOException {

            OPCPackage container;
            try {
                container = OPCPackage.open(file.getAbsolutePath());
                ReadOnlySharedStringsTable strings = new ReadOnlySharedStringsTable(container);
                XSSFReader xssfReader = new XSSFReader(container);
                StylesTable styles = xssfReader.getStylesTable();
                XSSFReader.SheetIterator iter = (XSSFReader.SheetIterator) xssfReader.getSheetsData();
                while (iter.hasNext()) {
                    InputStream stream = iter.next();

                    processSheet(styles, strings, stream);
                    stream.close();
                }
            } catch (InvalidFormatException e) {
                e.printStackTrace();
            } catch (SAXException e) {
                e.printStackTrace();
            } catch (OpenXML4JException e) {
                e.printStackTrace();
            }

    }

    protected void processSheet(StylesTable styles, ReadOnlySharedStringsTable strings, InputStream sheetInputStream) throws IOException, SAXException {

            InputSource sheetSource = new InputSource(sheetInputStream);
            SAXParserFactory saxFactory = SAXParserFactory.newInstance();
            try {
                SAXParser saxParser = saxFactory.newSAXParser();
                XMLReader sheetParser = saxParser.getXMLReader();
                ContentHandler handler = new XSSFSheetXMLHandler(styles, strings, new SheetContentsHandler() {

                @Override
                    public void startRow(int rowNum) {
                    }
                    @Override
                    public void endRow() {
                    }
                    @Override
                    public void cell(String cellReference, String formattedValue) {
                    }
                    @Override
                    public void headerFooter(String text, boolean isHeader, String tagName) {

                    }

                }, 
                false//means result instead of formula
                );
                sheetParser.setContentHandler(handler);
                sheetParser.parse(sheetSource);
            } catch (ParserConfigurationException e) {
                throw new RuntimeException("SAX parser appears to be broken - " + e.getMessage());


}