Question

我觉得我在这里缺少一些简单的东西（像往常一样）。

我正在尝试使用Java读取PGM图像。 Matlab做得很好 - 在Matlab中输出图像像素（例如，一个小的32x32图像）给我这样的东西：

1 0 11 49 94 118 118 106 95 88 85 96 124 143 142 133

然而，我的Java阅读器会输出：

1 0 11 49 94 118 118 106 95 88 85 96 124 65533 65533 65533

似乎高于127的像素值用65533填充，但它确实得到一些随机值不正确，甚至几乎将整个底行分配给-1的值。

这是我正在使用的代码：

filePath = 'imagepath.pgm';
FileInputStream fileInputStream = new FileInputStream(filePath);
BufferedReader bufferedReader = new BufferedReader(new InputStreamReader(fileInputStream));

// read the header information ...

int [][] data2D = new int [picWidth] [picHeight];

for (int row = 0; row < picHeight; row++) {
  for (int col = 0; col < picWidth; col++) {
    data2D[row][col] = bufferedReader.read();
    System.out.print(data2D[row][col] + " ");
  }
  System.out.println();
}

fileInputStream.close();

非常感谢任何想法。

编辑以下是未签名的PGM值：

     1     0    11    49    94   118   118   106    95    88    85    96   124   143   142   133
    30    26    29    57    96   122   125   114   102    94    91   101   127   146   145   136
    96    85    70    75   101   128   136   126   111   106   106   112   131   149   153   147
   163   147   114    93    99   120   132   123   110   113   124   129   137   154   166   168
   215   195   149   105    88    99   114   111   106   123   148   158   160   174   191   197
   245   224   173   115    81    82   100   109   117   144   179   194   194   205   222   230
   235   217   170   115    78    78   113   117   100    83    80   212   214   226   244   253
   178   167   135    93    68    78   123   129   106    77    69   202   204   222   244   255
   114   110    92    64    54    81   107   105    83    59    56   182   184   201   222   231
    79    80    71    52    55    97    67    55    41    33    42   184   179   181   185   183
    62    66    65    52    63   115    29    16    12    17    30   209   197   174   150   132
    40    47    52    44    55   109   171   196   188   186   208   229   218   179   136   107
    31    38    44    37    43    89   145   167   158   159   191   223   219   179   133   105
    48    52    56    51    57    91   128   133   117   120   157   196   200   168   128   105
    64    67    70    73    87   114   127   107    79    81   118   159   173   154   123   104
    63    67    73    83   107   132   129    91    54    54    88   130   153   146   123   106

标题如下所示：

P5
# MatLab PGMWRITE file, saved 27-Jun-2002
16 16
255

编辑＃2

以下是概念证明代码的完整输出：

Skipping unknow token: ""
Skipping unknow token: "1^vvj_XU`|���"
Skipping unknow token: ""
Skipping unknow token: "9`z}rf^[e���`UFKe��~ojjp������r]cx�{nq|������ÕiXcroj{��������sQRdmu��������٪sNNqudSP�����]DN{�jME�����rn\@6QkiS;8�����OPG47aC7)!*�����>BA4?s"
Skipping unknow token: ""
Skipping unknow token: ""
Skipping unknow token: "�Ů��(/4,7m�ļ���ڳ�k"
Skipping unknow token: "&,%+Y������۳�i04839[��ux��Ȩ�i@CFIWrkOQv���{h?CISk��[66X���{j"
Exception in thread "main" java.util.NoSuchElementException
    at java.util.Scanner.throwFor(Scanner.java:838)
    at java.util.Scanner.next(Scanner.java:1347)
    at Test.main(Test.java:49)

抛出异常中引用的第49行是：

System.out.println(String.format("Skipping unknow token: \"%s\"", scan.next()));

问题，我确定，这些图像文件包含ASCII文本/数字以及二进制图像数据。但是，如果Java读取PNG没有问题，为什么缺乏对PGM的支持？

编辑3

好的，我发现一个有效的实现......不幸的是，它被弃用了：

  filePath = "imagepath.pgm"
  FileInputStream fileInputStream = new FileInputStream(filePath);
  DataInputStream dis = new DataInputStream(fileInputStream);
  StreamTokenizer streamTokenizer = new StreamTokenizer(dis);

  // read header text using StreamTokenizer.nextToken()

  data2D = new int [picWidth] [picHeight];
  for (int row = 0; row < picHeight; row++) {
    for (int col = 0; col < picWidth; col++) {
      data2D[row][col] = dis.readUnsignedByte();
      System.out.print(data2D[row][col] + " ");
    }
    System.out.println();
  }

根据Java文档，不推荐使用StreamTokenizer(InputStream)构造函数，因为DataInputStream.readLine()方法无法正确地将原始字节转换为字符。但是，它似乎在标题上的这种特定情况下起作用，并且显然适用于随后的二进制图像数据。

不幸的是，它仍然被弃用了，似乎通过混合BufferedReader，因为文档建议只有在读取标题并尝试使用EOFException读取DataInputStream后才会生成{{1}} s原始字节。仍在寻找解决方案......

Answer 1

您的代码存在的问题是您使用了错误的类来从文件中读取原始数据。正如BufferedReader文档所说：

public int read() throws IOException

读取单个字符。

返回：字符读取，为0到65535（0x00-0xffff）范围内的整数，如果已到达流末尾，则为-1

因此，对read() BufferedReader方法的每次调用实际上都会从输入流中消耗一个或两个字节（基于字符编码），这不是您想要的。这也解释了为什么你得到了很多-1：流比你想象的要早得多。

由于PGM包含ASCII十进制值，因此使用Scanner类很容易解析。

这是一个几乎未经测试的代码，它显示了如何读取PGM图像，假设：

它包含一个幻数后的单个注释（即它没有以＃开头的行，除了第二个）
PGM文件正好是4行。

以下是代码：

String filePath = "image.pgm";
fileInputStream = new FileInputStream(filePath);
Scanner scan = new Scanner(fileInputStream);
// Discard the magic number
scan.nextLine();
// Discard the comment line
scan.nextLine();
// Read pic width, height and max value
int picWidth = scan.nextInt();
int picHeight = scan.nextInt();
int maxvalue = scan.nextInt();

fileInputStream.close();

 // Now parse the file as binary data
 fileInputStream = new FileInputStream(filePath);
 DataInputStream dis = new DataInputStream(fileInputStream);

 // look for 4 lines (i.e.: the header) and discard them
 int numnewlines = 4;
 while (numnewlines > 0) {
     char c;
     do {
         c = (char)(dis.readUnsignedByte());
     } while (c != '\n');
     numnewlines--;
 }

 // read the image data
 int[][] data2D = new int[picHeight][picWidth];
 for (int row = 0; row < picHeight; row++) {
     for (int col = 0; col < picWidth; col++) {
         data2D[row][col] = dis.readUnsignedByte();
         System.out.print(data2D[row][col] + " ");
     }
     System.out.println();
 }

需要实现：支持注释行，每个元素的值应除以maxvalue，错误检查格式错误的文件，异常处理。我在使用UNIX行尾的PGM文件上测试了它，但它也适用于Windows。

让我强调这不是PGM解析器的强大或完整的实现。此代码仅用作概念证明，可以满足您的需求。

如果您确实需要强大的PGM解析器，可以使用Netpbm提供的工具。

如何用Java读取PGM图像？

1 个答案: