hadoop - apache pig rank operator not working with multiple reducer -



hadoop - apache pig rank operator not working with multiple reducer -

i trying utilize pig's rank operator assign integer number given string. although works when set parallel clause 1, doesn't higher value (like 200). need utilize multiple reducer speed processing since default, pig using 1 reducer, takes long time.

my query follows :

rank = rank tupl1 col1 asc parallel 200;

actually according pig documentation (https://pig.apache.org/docs/r0.11.1/perf.html#parallel) :

you can include parallel clause operator starts cut down phase: cogroup, cross, distinct, group, bring together (inner), bring together (outer), , order by.

that's why have error think, it's not possible set parallel parameter rank.

hadoop hive apache-pig

Comments

Popular posts from this blog

model view controller - MVC Rails Planning -

ruby on rails - Devise Logout Error in RoR -

html - Submenu setup with jquery and effect 'fold' -