编写WordCount程序没有输出文件的解决办法_心蓝

http://blog.sina.com.cn/u/1989014743

首页博文目录关于我

个人资料

微博

加好友发纸条

写留言加关注

博客等级：
博客积分：

博客访问：
关注人气：
获赠金笔：0支
赠出金笔：0支
荣誉徽章：

正文字体大小：大中小

编写WordCount程序没有输出文件的解决办法

(2015-05-05 15:29:56)

标签：

it

分类： Hadoop

在编写WordCount程序时，我遇到了一个奇怪的问题：在HDFS上可以创建output文件夹，但里面却没有程序运行后的结果文件。如图所示，output3文件夹里面的文件数为0：http://s1/mw690/002aBHAXzy6S1S7wrW870&690

这让我百思不得其解。折腾了半天以后，偶然间发现我的Map端和Reduce端的函数误写为了Map（）和Reduce（），改为map( )和reduce( )即可。本人编写的程序如下：

import java.io.IOException;
import java.util.Iterator;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class TxtCounter{
public static void main(String[] args) throws IOException, ClassNotFoundException, InterruptedException {
  Configuration conf=new Configuration();
  String inputPathString="hdfs://master:9000/inputFile";
  String outputPathString="hdfs://master:9000/output2";

  Job job=new Job(conf,TxtCounter.class.getSimpleName());
  job.setMapperClass(TxtMapper.class);
  job.setCombinerClass(TxtReducer.class);
  job.setReducerClass(TxtReducer.class);
  job.setOutputKeyClass(Text.class);
  job.setOutputValueClass(IntWritable.class);

  FileInputFormat.addInputPath(job, new Path(inputPathString));
  FileOutputFormat.setOutputPath(job, new Path(outputPathString));

  job.waitForCompletion(true);

}

static class TxtMapper extends Mapper{
  protected void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException{
   String[] strs=value.toString().split(" ");
   for(String str:strs){
    context.write(new Text(str), new IntWritable(1));
   }
  }
}

static class TxtReducer extends Reducer, Text, IntWritable>{
  protected void reduce(Text key, Iterable values,Context context) throws IOException, InterruptedException{
   int sum=0;
   Iterator it=values.iterator();
   while(it.hasNext()){
    IntWritable value=it.next();
    sum+=value.get();
   }
   context.write(key, new IntWritable(sum));
  }
}
}

阅读┊ 收藏 ┊ 喜欢 ▼ ┊打印┊举报/Report

前一篇：使用Eclipse从本地向HDFS上传文件

后一篇：使用ToolRunner运行Hadoop程序基本原理分析【转】

新浪BLOG意见反馈留言板　欢迎批评指正