Here, we have two tables:
- Tab1 having columns id, name and age
- Tab2 having columns id, name and email
Using the below command to load data in pig,
tab1 = load ‘/mnt/home/edureka_425640/pig_join_1.txt’ using PigStorage(‘,’) as (id:int,name:chararray,age:int)
Dump tab1;
data:image/s3,"s3://crabby-images/f465b/f465b64a379853e0dcda596df7c11c868d5d24e1" alt="image image"
tab2 = load ‘/mnt/home/edureka_425640/pig_join_2.txt’ using PigStorage(‘,’) as (id:int,name:chararray,emal:chararrray)
Dump tab2;
data:image/s3,"s3://crabby-images/cd6a4/cd6a4eeed5043efe7f5e84f3bca2a7d12f4e9406" alt="image image"
Now, joining two tables on two columns
data:image/s3,"s3://crabby-images/7b954/7b9541dca9bbc7bec876a913c47904bf73b3c554" alt="image image"
The below is the output:
data:image/s3,"s3://crabby-images/0056e/0056ea29a8eec922f2029c9d24cf8339e2382f0d" alt="image image"
Hope this helps you.