将Mapreduce1项目迁移到YARN指南

时间:2015-06-30 11:03:07

标签: java hadoop mapreduce yarn

我有一个基于MapReduce 1的大型java项目,我想将它迁移到YARN。这里的任何人都知道如何做到这一点的好指南?

感谢。

1 个答案:

答案 0 :(得分:1)

I worked on up gradation of production Hadoop cluster along with the application. The process we followed is:

  1. We build a small cluster from desired distribution of hadoop which support YARN.
  2. We tested entire code like Map/Reduce program, Pig program hive script, sqoop script etc in new cluster and did the required changes in code. Generally there is not too much code side change. Just need to validated supported function, methods etc.
  3. Verified the output from old output, If your output match with old output then your code is good to go.If there is mismatch then you need to fix your code.
  4. Upgraded the hadoop cluster.
  5. Deployed the code on upgraded cluster.

Note: You cannot replace only MapReduce version 1 with YARN. You also need to upgrade pig, hive, sqoop etc. I suggests pic up the HDP stack which you need and then upgrade your cluster. All supported tools, utility and software should be upgraded with supported version of Hadoop.