制表符分隔字符串到数组

时间:2012-08-10 15:22:17

标签: java arrays string csv

我想要实现的是从Excel表格(保存在Txt,制表符分隔)中逐行读取数据,每个单独的列是我想要存储在数组中的不同数据。

我尝试了不同的方式..我甚至从互联网上下载了CSVReader类,但它没有用。 至少这次它的阅读真正的人物而不是西伯利亚。

我现在的版本是使用bufferedReader和String Tokenizer。 但它没有正确阅读。

以下是代码:

     import java.io.BufferedReader;
     import java.io.BufferedWriter;
     import java.io.File;
     import java.io.FileNotFoundException;
     import java.io.FileReader;
     import java.io.FileWriter;
     import java.io.IOException;
     import java.util.StringTokenizer;

     import com.csvreader.CsvReader;

     import au.com.bytecode.opencsv.CSVReader;


     public class excelToText{
     public static void main(String[] args) throws IOException {

    try
    {

        //csv file containing data
          BufferedReader CSVFile = new BufferedReader(new          FileReader("C:/Users/nhajjar/workspace/MB/src/yoyo.txt"));


          // Read first line.
          String dataRow; 
          int lineNumber = 0; 


          while ((dataRow = CSVFile.readLine() )!= null)
          {
                String [] dataArray;
                lineNumber++; 



                String delimiter = "\t";
                /* given string will be split by the argument delimiter provided. */
                dataArray = dataRow.split(delimiter);
                /* print substrings */

// for(int i = 0; i< dataArray.length; i ++) // System.out.println(dataArray [i]);

                String PrinterName = dataArray[0];
                String Model = dataArray[1];
                String IP = dataArray[2];
                String Location = dataArray[3];
                String Department = dataArray[4];
                String PrimServer = dataArray[5];
                String SecServer = dataArray[6];
                String ShareName = dataArray[7];
                String GroupNamePrefix = dataArray[8];
                String GroupNameSuffix = dataArray[9];
                String GroupNameFinal = dataArray[10];
                String WSPPrefix = dataArray[11];
                String WSPFull = dataArray[12];
                String PrimWSP = dataArray[13];
                String SecWSP = dataArray[14];

              System.out.println("PrinterName is : " + PrinterName);
                System.out.println("Model is : " + Model);
                System.out.println("IP is : " + IP);
                System.out.println("Location is : " + Location);
                System.out.println("Department is : " + Department);
                System.out.println("PrimServer is : " + PrimServer);
                System.out.println("SecServer is : " + SecServer);
                System.out.println("ShareName is : " + ShareName);
                System.out.println("GroupNamePrefix is : " + GroupNamePrefix);
                System.out.println("GroupNameSuffix is : " + GroupNameSuffix);
                System.out.println("GroupNameFinal is : " + GroupNameFinal);
                System.out.println("WSPPrefix is : " + WSPPrefix);
                System.out.println("WSPFull is : " + WSPFull);
                System.out.println("PrimWSP is : " + PrimWSP);
                System.out.println("PrimWSP is : " + PrimWSP);
                System.out.println("SecWSP is : " + SecWSP);



                /*//writing file for ones that were not sent out. 
                  File file = new File("write.txt");
                  BufferedWriter output = new BufferedWriter(new FileWriter(file));
                  */

            }




            CSVFile.close();

        }   catch (FileNotFoundException e) {
            e.printStackTrace();
        }   catch (IOException e) {
            e.printStackTrace();
        }   catch(Exception e)          {
          System.out.println("Exception while reading/writing csv file: " + e);                   
        }// end Exceptions 

  }// end try block 
     }

输入是:

    Printer name    Model   IP  Location    Department  Primary Server  Secondary Server    Share Name
    Boundary Sprinter Techs Lexmark E360dn      Boundary    Sprinter    s173m928site    s173mho1site    928-Sprinter-techs-L-E360dn
    Boundary Sprinter Xerox 7232    Xerox WorkCentre 7232       Boundary    Sprinter    s173m928site    s173mho1site    928-Sprinter-WC7232
    Boundry Parts   HP LaserJet P2055dn     Boundary    Parts   s173m928site    s173mho1site    928-Parts-LJ-P2055dn
    Boundry Sales   HP Color LaserJet CP4005        Boundary    Sales   s173m928site    s173mho1site    928-Sales-Main-CP4005
    Boundry Techs East  HP LaserJet P3015       Boundary    Techs East  s173m928site    s173mho1site    928-Techs-east-LJ-P3015
    Boundry Techs West  Lexmark E352dn      Boundary    Techs West  s173m928site    s173mho1site    928-Techs-west-L-E352dn
    Concord     Lexmark E360dn      Concord         s173mho1site    
    Dundas Parts    Xerox WorkCentre 7232       Dundas  Parts   s173m910site    s173mho1site    910-Parts-WC-7232
    Dundas Preowned Xerox WorkCentre 7425       Dundas  Preowned    s173m910site    s173mho1site    910-PreOwned-WC-7425
    Dundas Sales 2nd Floor  HP Color LaserJet CP4025        Dundas  Sales   s173m910site    s173mho1site    910-Sales-2nd-CP4025
    Dundas Sales Main Floor HP Color LaserJet CP4025        Dundas  Sales   s173m910site    s173mho1site    910-Sales-Main-CP4025


output im getting is : 


PrinterName is : Printer name 
Model is : Model
IP is : IP
Location is : Location
Department is : Department
PrimServer is : Primary Server 
SecServer is : Secondary Server
ShareName is : Share Name
GroupNamePrefix is : Group Name Prefix
GroupNameSuffix is : Group Name Suffix
GroupNameFinal is : Group Name Final
WSPPrefix is : WSP Prefix
WSPFull is : WSP Full
PrimWSP is : Primary WSP
PrimWSP is : Primary WSP
SecWSP is : Secondary WSP


PrinterName is : Boundary Sprinter Techs
Model is : Lexmark E360dn
IP is : 53.254.177.138
Location is : Boundary
Department is : Sprinter
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Sprinter-techs-L-E360dn
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Sprinter-techs-L-E360dn
GroupNameFinal is : D173_PRINTER-928-Sprinter-techs-L-E360dn
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Sprinter-techs-L-E360dn.wsp;D173_PRINTER-928-Sprinter-techs-L-E360dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-techs-L-E360dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-techs-L-E360dn
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Sprinter-techs-L-E360dn


PrinterName is : Boundary Sprinter Xerox 7232
Model is : Xerox WorkCentre 7232
IP is : 53.254.177.136
Location is : Boundary
Department is : Sprinter
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Sprinter-WC7232
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Sprinter-WC7232
GroupNameFinal is : D173_PRINTER-928-Sprinter-WC7232
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Sprinter-WC7232.wsp;D173_PRINTER-928-Sprinter-WC7232
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-WC7232
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-WC7232
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Sprinter-WC7232


PrinterName is : Boundry Parts
Model is : HP LaserJet P2055dn
IP is : 53.254.193.222
Location is : Boundary
Department is : Parts
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Parts-LJ-P2055dn
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Parts-LJ-P2055dn
GroupNameFinal is : D173_PRINTER-928-Parts-LJ-P2055dn
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Parts-LJ-P2055dn.wsp;D173_PRINTER-928-Parts-LJ-P2055dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Parts-LJ-P2055dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Parts-LJ-P2055dn
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Parts-LJ-P2055dn


PrinterName is : Boundry Sales
Model is : HP Color LaserJet CP4005
IP is : 53.254.193.117
Location is : Boundary
Department is : Sales
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Sales-Main-CP4005
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Sales-Main-CP4005
GroupNameFinal is : D173_PRINTER-928-Sales-Main-CP4005
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Sales-Main-CP4005.wsp;D173_PRINTER-928-Sales-Main-CP4005
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sales-Main-CP4005
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sales-Main-CP4005
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Sales-Main-CP4005


PrinterName is : Boundry Techs East
Model is : HP LaserJet P3015
IP is : 53.254.193.220
Location is : Boundary
Department is : Techs East
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Techs-east-LJ-P3015
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Techs-east-LJ-P3015
GroupNameFinal is : D173_PRINTER-928-Techs-east-LJ-P3015
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Techs-east-LJ-P3015.wsp;D173_PRINTER-928-Techs-east-LJ-P3015
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-east-LJ-P3015
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-east-LJ-P3015
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Techs-east-LJ-P3015


PrinterName is : Boundry Techs West
Model is : Lexmark E352dn
IP is : 53.254.193.221
Location is : Boundary
Department is : Techs West
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Techs-west-L-E352dn
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Techs-west-L-E352dn
GroupNameFinal is : D173_PRINTER-928-Techs-west-L-E352dn
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Techs-west-L-E352dn.wsp;D173_PRINTER-928-Techs-west-L-E352dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-west-L-E352dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-west-L-E352dn
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Techs-west-L-E352dn
Exception while reading/writing csv file: java.lang.ArrayIndexOutOfBoundsException: 7

注意:我关闭了输出。这些'Null块'更多......

4 个答案:

答案 0 :(得分:2)

看起来你过于复杂了。 CSV文件具有逗号分隔值。因此,您的文件数据应如下所示:

第1行:细胞1,细胞2,细胞3,细胞4等......

第2行:第2行的单元格1,第2行的第2单元格等......

您使用制表符分隔文件,因此需要在选项卡上进行拆分

//Read first line 
String dataRow;
int lineNumber = 0;
while((dataRow = CSVFile.readline()) != null)
{
    String [] dataArray;
    lineNumber++;
    String delimiter = "\t";
    dataArray = dataRow.split(delimiter); //Now dataArray contains all the tab delimited cells that were on line one of the .txt file
    //Start assigning the information to your variables since it is stored in dataArray
    String PrinterName = dataArray[0];//This would assign the first cell (row 1, column 1 in excel) that was read from the text file. From looking at your input this should be "Printer name"
    String Model = dataArray[1];
    String IP = dataArray[2];
    //etc...Assign the rest
    //Print your output
    //Do anything else you need to do
 }//end the while loop

以下是一些消息来源:

http://www.java-examples.com/java-string-split-example

注意:此示例更接近您正在使用的“\ t”而不是“,”: http://www.daniweb.com/software-development/java/threads/17262/reading-in-a-.csv-file-and-loading-the-data-into-an-array

希望这会有所帮助

答案 1 :(得分:0)

标记只按空格和加号划分。这是因为StringTokenizer的第二个参数不允许正则表达式。尝试使用"\t"字段似乎真正分开。

顺便说一下,每行真的不需要100个字符串数组。

答案 2 :(得分:0)

我建议使用String.split()函数,因为这会大大简化您的代码。

答案 3 :(得分:0)

我不建议您自己使用.split或StringTokenizer类拆分数据。他们不考虑现场报价等...

使用像(如上所述)OpenCSV或StrTokenizer(commons-lang的一部分)这样的库来为您进行解析。

使用制表符分隔的解析器的StrTokenizer示例:

StrTokenizer tokenizer = StrTokenizer.getTSVInstance();    
while ( (line = ...) != null) {
    tokenizer.reset(line);
    String tokArray[] = tokenizer.getTokenArray();
}