Custom Partitioner in Hadoop

I found few very very good links for writing own custom partitioner class for hadoop , Just wanted to share with you all.

 

A very good course being taught for Cloud computing and Map Reduce

http://www.cs.bgu.ac.il/~dsp112/The_Map-Reduce_Pattern

A mailing list discussion for writing Custom partitioner with Job context use by Configurable interface

http://lucene.472066.n3.nabble.com/Custom-partitioner-for-hadoop-td1335146.html

A very good blog post in how to use and write Partitioner  in new Hadoop API

http://cornercases.wordpress.com/2011/05/06/an-example-configurable-partitioner/

 

Besides this if you want to see implementations , there are by default following present in hadoop

All are under Package org.apache.hadoop.mapreduce.lib.partition

BinaryPartitioner

Partition keys using a configurable part of the bytes array

HashPartitioner

Partition keys by their Object.hashCode().

KeyFieldBasedPartitioner

Defines a way to partition keys based on certain key fields

TotalOrderPartitioner

Partitioner effecting a total order by reading split points from an externally generated source.

If you know some good link , please do share here in comments.

 

Thanks for reading.

No comments:

Post a Comment

Please share your views and comments below.

Thank You.