Apache Flink helps build big data in a efficient and scalable way. Learn how to use it to read data from a file, transform it to uppercase, and write it to another file.
This tutorial will provide some information on how to set up an Elasticsearch cluster and will add some operational tips and best practices to help you get started.
Partitioning strategies depend on the shape of the data and what type of processing the apps do. As you scale, you may need to adapt your strategies to handle new data volumes and shapes.
Orange is an extremely easy-to-use, lightweight, drag-and-drop tool for building machine learning models and analyzing data. More importantly, it is open source!
It's imperative to have a system that can analyze large amounts of logs and present a singular view. Centralized logging provided by the Elastic Stack is a step in the right direction.
Kubernetes handles containers in several computers, removing the complexity of handling distributed processing. But what's the best way to perform Kubernetes monitoring?