Sqoop2 Architecture and current status

Sqoop is a tool which is used to transfer data to/from RDBMS systems from/to Hadoop HDFS.

Sqoop is undergoing is a major architectural change with discussions going on for Sqoop2 feature proposals.

A very good overview about what any why changes are needed in Sqoop is documented in the proposal at Apache wiki below.

https://blogs.apache.org/sqoop/entry/apache_sqoop_highlights_of_sqoop

You can also go through the Sqoop2 presentation

Current Status

The design discussions are being tracked at Sqoop Jira 365

Summary and goals for Sqoop2 architecture has been documented at Apache Sqoop wiki

https://cwiki.apache.org/SQOOP/sqoop-2.html

Weekly meetings are being organized to discuss the progress of the work

https://cwiki.apache.org/SQOOP/sqoop2-weekly-meeting-minutes.html

The list of JIRAs against the task division is documented below

https://cwiki.apache.org/SQOOP/sqoop-2-jiras.html

If you want to build and setup sqoop2 install then you can also read following

http://jugnu-life.blogspot.in/2012/06/sqoop2-build-and-installation.html

No comments:

Post a Comment

Please share your views and comments below.

Thank You.