I found few very very good links for writing own custom partitioner class for hadoop , Just wanted to share with you all.
A very good course being taught for Cloud computing and Map Reduce
http://www.cs.bgu.ac.il/~dsp112/The_Map-Reduce_Pattern
A mailing list discussion for writing Custom partitioner with Job context use by Configurable interface
http://lucene.472066.n3.nabble.com/Custom-partitioner-for-hadoop-td1335146.html
A very good blog post in how to use and write Partitioner in new Hadoop API
http://cornercases.wordpress.com/2011/05/06/an-example-configurable-partitioner/
Besides this if you want to see implementations , there are by default following present in hadoop
All are under Package org.apache.hadoop.mapreduce.lib.partition
BinaryPartitioner
Partition keys using a configurable part of the bytes array
HashPartitioner
Partition keys by their Object.hashCode().
KeyFieldBasedPartitioner
Defines a way to partition keys based on certain key fields
TotalOrderPartitioner
Partitioner effecting a total order by reading split points from an externally generated source.
If you know some good link , please do share here in comments.
Thanks for reading.
No comments:
Post a Comment
Please share your views and comments below.
Thank You.