Over a million developers have joined DZone.

How to Back Up Cassandra and MongoDB [Video]

DZone's Guide to

How to Back Up Cassandra and MongoDB [Video]

Of course, there are also applications that are not installed on any VMs but are instead running in containers — but how do you backup a VM that does not exist?

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

We recently had the opportunity to work with Curtis Preston, “Mr. Backup” at Storage Switzerland, an analyst firm focused on the storage, virtualization, and cloud marketplaces. Curtis Preston is an expert in backup and recovery systems, a space he has been working in since 1993. He interviewed Director of Products, Shalabh Goyal in this Chalk Talk Video below.

According to Curtis Preston, software was designed with a traditional data center in mind. It was made to backup a series of servers to a series of tape or disk drives via a network of media servers. The classic backup software product that was originally designed for physical servers has adapted over time to the idea that some — if not most — of the servers might be virtual. It has also added functionality for really large servers to be able to backup directly to the backup device without going through a media server. And it has also adapted its support for the device itself, with the most significant advancement being deduplication. One might make the argument that there are newer backup products that were designed to backup virtual machines directly to deduplicated disk, and these products should not be considered traditional backup products.

However, when we think about modern cloud applications such as ultra-scalable databases that run on multiple nodes, we are faced with the reality that even modern backup software packages that were designed for virtualization still weren’t designed with much of the cloud in mind. All traditional and most modern backup software packages are still based around the idea of backing up a physical machine or virtual machine to some type of device, be it disk or tape.

The problem with this design is that there are many applications in the cloud that are either not running in a VM or certainly not running in a single VM. There are applications whose data is distributed across dozens to hundreds of nodes, and no single node has access to all the data. How do you backup something like that when you only know how to talk to a VM? Of course, there are also applications that are not installed on any VMs but are instead running in containers — but how do you backup a VM that does not exist? To learn, make sure you watch the video.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

big data ,data centers ,cassandra ,mongodb

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}