mapreduce - Updating HIVE table with new records and upsert records based on key -



mapreduce - Updating HIVE table with new records and upsert records based on key -

i next below approch -

there table work.data_perm avialble , new , info has updated in such form if assuming 100 rows in work.data_perm , new file have 10 records out of 9 has updated , 1 new record-

steps followed -

create new table work.data_pty , insert record of table work.data_perm . append table file been recieved (10 rows ) referred above. run below query desired info -

select t1.pty_key, t1.prcs_dt work.data_pty t1 inner bring together (select pty_key, max(prcs_dt) max_date work.data_pty grouping pty_key) t2 on t2.pty_key = t1.pty_key , t2.max_date = t1.prcs_dt

we want utilize java map cut down doing activity ? please suggest approch. responses.

you can utilize java map cut down doing activity. after writing mapper , reducer code in java, can plug them hive query.

you can see how write custom mapper , reducer hive queries here

also see this

mapreduce hive

Comments

Popular posts from this blog

php - Android app custom user registration and login with cookie using facebook sdk -

django - Access session in user model .save() -

php - .htaccess Multiple Rewrite Rules / Prioritizing -