Over a million developers have joined DZone.

An Impatient Start With the Apache Ignite Machine Learning Grid

DZone's Guide to

An Impatient Start With the Apache Ignite Machine Learning Grid

In this short post, we are going to learn how to download the new Apache Ignite 2.0 release, build the example, and run it.

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

Recently, Apache Ignite 2.0 introduced a beta version of the in-memory Machine Learning grid, which is a distributed Machine Learning library built on top of the Apache IMDG. This beta release of ML library can perform local and distributed vectors, decompositions, and matrix algebra operations. The data structure can be stored in a Java heap, off-heap, or in distributed Ignite caches. At this moment, the Apache Ignite ML grid doesn't support any prediction or recommendation analysis.

In this short post, we are going to download the new Apache Ignite 2.0 release, build the example, and run it.

1. Download and Unpack the Apache Ignite 2.0 Distribution

Download the Apache Ignite 2.0 binary release version from this link. Unpack the distribution somewhere in your workstation (e.g. /home/ignite/2.0) and set the IGNITE_HOME path to the directory.

2. Start the Apache Ignite Remote Node

Run the following command in the terminal window:

ignite.sh examples/config/example-ignite.xml

Note that remote nodes for examples should always be started with the special configuration file which enables P2P class loading: examples/config/example-ignite.xml. Also, note that Apache Ignite version 2.0 needs Java version 1.8 or higher.

3. Build the Machine Learning Examples

Go to the /examples folder of the Apache Ignite distribution. If you've already installed and configured Maven, run the following command from the examples folder:

mvn clean install -Pml

The above command will active the Machine Learning (ml) profile and build the project.

4. Run It

Let's run the simple local on-heap version of the vector example. Execute the following command in your terminal windows:

mvn exec:java -Dexec.mainClass=org.apache.ignite.examples.ml.math.vector.VectorExample

You should get the following logs in your console:

Figure 1.

All the examples are autonomous and don't need any special configuration. Examples named with Cache such as CacheMatrixExample or CacheVectorExample need a remote Ignite node with P2P class loading. Let's run CacheMatrixExample with the following command:

mvn exec:java -Dexec.mainClass=org.apache.ignite.examples.ml.math.matrix.CacheMatrixExample

You should get the following output:

Figure 2.

Additionally, Apache Ignite ML grid provides a simple utility class that allows pretty printing of matrices and vectors. You can run TracerExample as follows:

mvn exec:java -Dexec.mainClass=org.apache.ignite.examples.ml.math.tracer.TracerExample

This above command will launch a web browser and provide some HTML output as follows:
Figure 3.This is enough for now. Learn even more from the High-performance in-memory computing with Apache Ignite book.

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

machine learning ,big data ,tutorial ,apache ignite ,grids ,in-memory data grid

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}