hadoop - Point Mahout to HDFS instead of local disk -



hadoop - Point Mahout to HDFS instead of local disk -

i trying run logistic regression on mahout file that's in hdfs - name of file ppeng.txt next command line.

mahout org.apache.mahout.classifier.sgd.trainlogistic --passes 5 --rate 1 --lambda 0.5 --input ppeng.txt --features 21 --output test_mahout --target nbr_of_txns --categories 2 --predictors lifetime_rev_usd_amt ntpv_12_mth_sent_usd_amt --types n n

this file in hdfs, but, line errors out file not found exception unless re-create file local machine.

my hadoop_local variable set null well. error follows - has have experience fixing problem - if so, please help.

exception in thread "main" java.io.filenotfoundexception: ppeng.txt (no such file or directory)** @ java.io.fileinputstream.open(native method) @ java.io.fileinputstream.<init>(fileinputstream.java:120) @ org.apache.mahout.classifier.sgd.trainlogistic.open(trainlogistic.java:316) @ org.apache.mahout.classifier.sgd.trainlogistic.maintooutput(trainlogistic.java:75) @ org.apache.mahout.classifier.sgd.trainlogistic.main(trainlogistic.java:64) @ sun.reflect.nativemethodaccessorimpl.invoke0(native method) @ sun.reflect.nativemethodaccessorimpl.invoke(nativemethodaccessorimpl.java:39) @ sun.reflect.delegatingmethodaccessorimpl.invoke(delegatingmethodaccessorimpl.java:25) @ java.lang.reflect.method.invoke(method.java:597) @ org.apache.hadoop.util.programdriver$programdescription.invoke(programdriver.java:68) @ org.apache.hadoop.util.programdriver.driver(programdriver.java:139) @ org.apache.mahout.driver.mahoutdriver.main(mahoutdriver.java:188) @ sun.reflect.nativemethodaccessorimpl.invoke0(native method) @ sun.reflect.nativemethodaccessorimpl.invoke(nativemethodaccessorimpl.java:39) @ sun.reflect.delegatingmethodaccessorimpl.invoke(delegatingmethodaccessorimpl.java:25) @ java.lang.reflect.method.invoke(method.java:597) @ org.apache.hadoop.util.runjar.main(runjar.java:197)

it accessing local file scheme java.io.fileinputstream.open.

can run

hadoop fs -ls

from running mahout job? if need unset mahout_local , seek again. mahout uses hadoop setup on machine launching determine hdfs , mapreduce job trackers are.

here's post switching between client, local pseudo cluster, , total cluster membership on single machine. set right flag mahout. bash functions.

http://occamsmachete.com/ml/2012/07/02/how-to-switch-hadoop-confugurations/

hadoop hdfs mahout

Comments

Popular posts from this blog

php - Android app custom user registration and login with cookie using facebook sdk -

django - Access session in user model .save() -

php - .htaccess Multiple Rewrite Rules / Prioritizing -