Hadoop基础-MapReduce的排序

　　　　　　　　　　　　　　　　　　　　Hadoop基础-MapReduce的排序

　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　作者：尹正杰

一.MapReduce的排序分类

1>.部分排序

　　部分排序是对单个分区进行排序，举个简单的例子，第一个分区中的数据为1,3,5；而第二个分区为2,4,这两个分区的值看起来是没有连续性的，但是每个分区中的数据又是排序的！下面是我画的一个草图：

Hadoop基础-MapReduce的排序

2>.全排序

　　全排序是对所有分区中的数据均排序，比如第一个分区的值为1,2,3,而第二个分区为4,5 很显然2个分区是经过排序的，可以明显的看清楚每个分区的具体的取值规范。下面是我画的一个草图：

Hadoop基础-MapReduce的排序

3>.二次排序

　　二次排序是指对key排序的基础上，对value进行排序。

Hadoop基础-MapReduce的排序

二.全排序的实现方案

26 May, 2017: Release 3.0.0-alpha3 available
This is a security release in the 3.0.0 release line. It consists of alpha2 plus security fixes, along with necessary build-related fixes. Users on 3.0.0-alpha1 and 3.0.0-alpha2 are encouraged to upgrade to 3.0.0-alpha3.

Please note that alpha releases come with no guarantees of quality or API stability, and are not intended for production use.

Users are encouraged to read the overview of major changes coming in 3.0.0. The alpha3 release notes and changelog detail the changes since 3.0.0-alpha2.

wordCount.txt 文件内容