我是hadoop编程的新手,在尝试减少侧连接时遇到错误错误:java.lang.NumberFormatException:对于输入字符串:" 100&#34 ;. 我已检查输入文件是否正确。
我的减速机代码是`
public void reduce (IntWritable key , Iterable<Text> value , Context context) throws IOException , InterruptedException{
// value : TRAN\t product name \t Amt and CUST \t custometr name
String data = new String();
int amount=0;
String name =null;
StringBuilder s = new StringBuilder();
for (Text val : value){
String[] line = val.toString().split("\t");
if (line[0].equals(new String("TRANS").trim())){
//data += line[1]+"\t";
s.append(line[1]+"");
amount+=Integer.parseInt(line[2]);
}
else if(line[0].equals(new String("CUST").trim())){
name = line[1];
}
data= s.toString()+ Integer.toString(amount);
context.write(new Text(name), new Text(data));
我的两个映射器是:
package reduceSideJoin.reducejoin;
public class transMapper extends Mapper<LongWritable ,Text , IntWritable , Text > {
public void map(LongWritable key , Text value , Context context) throws IOException , InterruptedException{
String[] line = value.toString().split(",");
String data = "TRANS"+ "\t" + line[1]+"\t " + line[3];
context.write(new IntWritable (Integer.parseInt(line[0])), new Text(data) );
}
}
和
package reduceSideJoin.reducejoin;
public class userMapper extends Mapper<LongWritable, Text, IntWritable, Text> {
public void map (LongWritable key, Text value , Context context) throws IOException , InterruptedException{
String [] line = value.toString().split(",");
String data = "CUST"+"\t"+line[1];
context.write(new IntWritable (Integer.parseInt(line[0])), new Text(data));
}
}
我的文件是
交易数据
0001,crax,2,300
0002,munch,1,10
0003,lays,1,20
0004,ship,1,200
0005,barOne,3,400
0002,crax,2,300
0001,kurkure,3,100
0003,milk,1,20
0004,butter,2,300
0005,meat,1,1220
0002,color,1,230
0003,dailrymilk,1,20
和客户数据
0001,Sunil Kumar , Mumbai,India
0002,Vikas mandal, Haryana, India
0003,Karan, JFK,France
0004,manish,banglore,India
0005,devesh,meerut,India
我得错误16/05/30 00:23:00 INFO mapreduce.Job:地图100%减少0% 16/05/30 00:23:04 INFO mapreduce.Job:任务ID:attempt_1464547777880_0001_r_000000_0,状态:未通过 错误:java.lang.NumberFormatException:对于输入字符串:&#34; 100&#34; at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
有人可以帮我解决此问题吗
由于
答案 0 :(得分:1)
NumberFormatException是因为:
下面的行中有一个额外的空格(在最后一个“\ t”之后):
String data = "TRANS"+ "\t" + line[1]+"\t " + line[3];
您应该替换为:
String data = "TRANS"+ "\t" + line[1]+"\t" + line[3];
之后,您仍然会在行中收到NullPointerException:
context.write(new Text(name), new Text(data));
那是因为你只在“else”块中设置变量name
。
您可以通过以这种方式设置name
来解决此问题:
if (line[0].equals(new String("TRANS").trim())){
name = line[1]; // <==== here
s.append(line[1]+"");
amount+=Integer.parseInt(line[2]);
}