Using Mahout commands in DataStax Enterprise (deprecated)

Run Mahout commands on the dse command line.

Hadoop is deprecated for use with DataStax Enterprise. DSE Hadoop and BYOH (Bring Your Own Hadoop) are deprecated. Mahout is also deprecated and will be removed when Hadoop is removed.

You can run Mahout commands on the dse command line. For example on Mac OS X, to get a list of which commands are available:

cd install_location
$ bin/dse mahout

The list of commands appears.

Mahout command line help

You use one of these commands as the first argument plus the help option:

cd install_location
$ bin/dse mahout arff.vector --help

The output is help on the arff.vector command.

Add Mahout classes to the class path, execute Hadoop command

You use Hadoop shell commands to work with Mahout. Using this syntax first adds Mahout classes to the class path, and then executes the Hadoop command:

cd install_location
$ bin/dse mahout hadoop fs -text mahout_file | more

The Apache web site offers an in-depth Mahout tutorial.