hadoop - Point Mahout to HDFS instead of local disk -
hadoop - Point Mahout to HDFS instead of local disk -
i trying run logistic regression on mahout file that's in hdfs - name of file ppeng.txt next command line.
mahout org.apache.mahout.classifier.sgd.trainlogistic --passes 5 --rate 1 --lambda 0.5 --input ppeng.txt --features 21 --output test_mahout --target nbr_of_txns --categories 2 --predictors lifetime_rev_usd_amt ntpv_12_mth_sent_usd_amt --types n n
this file in hdfs, but, line errors out file not found exception unless re-create file local machine.
my hadoop_local variable set null well. error follows - has have experience fixing problem - if so, please help.
exception in thread "main" java.io.filenotfoundexception: ppeng.txt (no such file or directory)** @ java.io.fileinputstream.open(native method) @ java.io.fileinputstream.<init>(fileinputstream.java:120) @ org.apache.mahout.classifier.sgd.trainlogistic.open(trainlogistic.java:316) @ org.apache.mahout.classifier.sgd.trainlogistic.maintooutput(trainlogistic.java:75) @ org.apache.mahout.classifier.sgd.trainlogistic.main(trainlogistic.java:64) @ sun.reflect.nativemethodaccessorimpl.invoke0(native method) @ sun.reflect.nativemethodaccessorimpl.invoke(nativemethodaccessorimpl.java:39) @ sun.reflect.delegatingmethodaccessorimpl.invoke(delegatingmethodaccessorimpl.java:25) @ java.lang.reflect.method.invoke(method.java:597) @ org.apache.hadoop.util.programdriver$programdescription.invoke(programdriver.java:68) @ org.apache.hadoop.util.programdriver.driver(programdriver.java:139) @ org.apache.mahout.driver.mahoutdriver.main(mahoutdriver.java:188) @ sun.reflect.nativemethodaccessorimpl.invoke0(native method) @ sun.reflect.nativemethodaccessorimpl.invoke(nativemethodaccessorimpl.java:39) @ sun.reflect.delegatingmethodaccessorimpl.invoke(delegatingmethodaccessorimpl.java:25) @ java.lang.reflect.method.invoke(method.java:597) @ org.apache.hadoop.util.runjar.main(runjar.java:197)
it accessing local file scheme java.io.fileinputstream.open.
can run
hadoop fs -ls
from running mahout job? if need unset mahout_local , seek again. mahout uses hadoop setup on machine launching determine hdfs , mapreduce job trackers are.
here's post switching between client, local pseudo cluster, , total cluster membership on single machine. set right flag mahout. bash functions.
http://occamsmachete.com/ml/2012/07/02/how-to-switch-hadoop-confugurations/
hadoop hdfs mahout
Comments
Post a Comment