I am written a hadoop program, I know that I can directly pass arguments to hadoop use args[], I mean currently is like this
ToolRunner.run(new Configuration(), new RunDear(), args);
but if there are many arguments, can I make a configuration file like below and pass to hadoop? where should this file located, in local file system or hdfs?
sample_size 200
input_genotype_file /data/genotypes.txt
input_phenotype_file /data/phenotypes.txt
output_directory /outout
mtry 200
ntree 3000
distance 0 (e.g. 0=euclidean, 1=mehalanobis
variable_important 0 (e.g. 0=information gain, 1=permutation)
etc….