Over a million developers have joined DZone.

An Impatient Start With the Apache Ignite Machine Learning Grid

DZone's Guide to

An Impatient Start With the Apache Ignite Machine Learning Grid

In this short post, we are going to learn how to download the new Apache Ignite 2.0 release, build the example, and run it.

· Big Data Zone
Free Resource

Free O'Reilly eBook: Learn how to architect always-on apps that scale. Brought to you by Mesosphere DC/OS–the premier platform for containers and big data.

Recently, Apache Ignite 2.0 introduced a beta version of the in-memory Machine Learning grid, which is a distributed Machine Learning library built on top of the Apache IMDG. This beta release of ML library can perform local and distributed vectors, decompositions, and matrix algebra operations. The data structure can be stored in a Java heap, off-heap, or in distributed Ignite caches. At this moment, the Apache Ignite ML grid doesn't support any prediction or recommendation analysis.

In this short post, we are going to download the new Apache Ignite 2.0 release, build the example, and run it.

1. Download and Unpack the Apache Ignite 2.0 Distribution

Download the Apache Ignite 2.0 binary release version from this link. Unpack the distribution somewhere in your workstation (e.g. /home/ignite/2.0) and set the IGNITE_HOME path to the directory.

2. Start the Apache Ignite Remote Node

Run the following command in the terminal window:

ignite.sh examples/config/example-ignite.xml

Note that remote nodes for examples should always be started with the special configuration file which enables P2P class loading: examples/config/example-ignite.xml. Also, note that Apache Ignite version 2.0 needs Java version 1.8 or higher.

3. Build the Machine Learning Examples

Go to the /examples folder of the Apache Ignite distribution. If you've already installed and configured Maven, run the following command from the examples folder:

mvn clean install -Pml

The above command will active the Machine Learning (ml) profile and build the project.

4. Run It

Let's run the simple local on-heap version of the vector example. Execute the following command in your terminal windows:

mvn exec:java -Dexec.mainClass=org.apache.ignite.examples.ml.math.vector.VectorExample

You should get the following logs in your console:

Figure 1.

All the examples are autonomous and don't need any special configuration. Examples named with Cache such as CacheMatrixExample or CacheVectorExample need a remote Ignite node with P2P class loading. Let's run CacheMatrixExample with the following command:

mvn exec:java -Dexec.mainClass=org.apache.ignite.examples.ml.math.matrix.CacheMatrixExample

You should get the following output:

Figure 2.

Additionally, Apache Ignite ML grid provides a simple utility class that allows pretty printing of matrices and vectors. You can run TracerExample as follows:

mvn exec:java -Dexec.mainClass=org.apache.ignite.examples.ml.math.tracer.TracerExample

This above command will launch a web browser and provide some HTML output as follows:
Figure 3.This is enough for now. Learn even more from the High-performance in-memory computing with Apache Ignite book.

Easily deploy & scale your data pipelines in clicks. Run Spark, Kafka, Cassandra + more on shared infrastructure and blow away your data silos. Learn how with Mesosphere DC/OS.

machine learning ,big data ,tutorial ,apache ignite ,grids ,in-memory data grid

Published at DZone with permission of Shamim Bhuiyan. See the original article here.

Opinions expressed by DZone contributors are their own.


Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.


{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}