Teradata and Hadoop have been talking via Sqoop since long. In which Sqoop was used to import export data in and out of Hadoop and Teradata.
Teradata has new tool called SQL-H
SQL Hadoop which allows to connect directly to Hadoop from Teradata and perform operations.
The two links below explain all about them.
But before going to main link I will suggest you to have a quick read at Teradata architecture. ( only 2 mins read)
What is the function of
PE – Parsing Engine , parses the query entered by user
AMP –Do actiual disk operations
BYNET – Handles communication between AMP and PE
Now comes one additional component EAH
EAH is the External Access Handler , which talks to HCatalog on Hadoop side to get information related to table
You can run queries like
SELECT Price
, CAST(Make AS VARCHAR(20))
, CAST(Model AS VARCHAR(20))
FROM LOAD_FROM_HCATALOG(
USING
SERVER('sdll4364.labs.teradata.com')
PORT('9083')
USERNAME ('hive')
DBNAME('default')
TABLENAME('CarPriceData')
COLUMNS('*')
TEMPLETON_PORT('1880')
) as DT;
Where server is details about hadoop cluster
And we are saying in Teradata , use Hcatalog and talk to Hadoop execute this query bring back the results.
There is very good presentation which explains more
More reading at
Thanks for reading
No comments:
Post a Comment
Please share your views and comments below.
Thank You.