We are entering an exciting phase in our journey. If you followed our announcements in the last couple of months — we launched RecoverX, our first product, onboarded early adopters, and got recognized by Gartner for the newly defined Cloud Data Backup category in the 2016 hype cycle for storage technologies. So what’s new and where are we headed? I am proud to announce our partnership with AWS today. In this post, I will set the context for the announcement, use cases that we are focused on, and provide a glimpse into what’s ahead.
There are several foundational changes occurring in the IT industry. One of the major transitions for companies across the board is the move towards the everything as a service (‘X’aaS) model. Cloud services are growing at 30+% yearly rate, and rightly so. It is easy to witness this transition, where cloud-based service providers such as Amazon, Salesforce, Microsoft, and Google are reporting massive revenue growth for their cloud platforms. Even within enterprises that have majority of their IT infrastructure on-premise, central IT teams are morphing into as-a-service providers (aka, the private cloud) to internal businesses. But why are enterprises making this transition? It is because ‘as-a-service’ model allows for faster time to value and the ability to respond to market changes quickly. The rise of as-a-service model has also resulted in enterprises using multiple cloud service providers depending on how well each provider meets specific business or project requirements. In some cases, production applications are deployed on internal private cloud but engineering teams have access to public cloud resources for new feature development and testing, or vice versa. So, without a doubt, multi-cloud is a reality. However, as enterprises look to adopt and scale their use of cloud platforms, they cannot compromise on data protection, masking, availability, and data portability. While public cloud provides data availability (aka, availability zones), the lack of rich data protection services is a gap for next-generation applications and databases, and thus hinders enterprises’ adoption of a “cloud-first” strategy.
At Datos IO, we have been focused on pioneering data protection at scale with the goal of making it application-centric for DevOps and application developers, work across multiple databases (multi-platform) from day one, and making it cloud-centric for ease of use. So, why are we announcing the AWS partnership and what is special about it? It is simply because it is “customer-led!” One of our early adopters runs its IoT platform on AWS and uses the Cassandra database. They needed a point-in-time recovery solution that is cloud-native and scales with their growing application needs. This drove us to work with the AWS team and get certified and listed on the Amazon Partner Network (APN).
Now, the exciting part — with today’s announcement, Datos IO RecoverX is the ONLY cloud-native backup and recovery software for next-generation databases such as Apache Cassandra and MongoDB. Check out our profile on the Amazon Partner Network.
In addition to providing cloud-native data protection to protect data natively where the application is deployed, I believe that there are other equally important requirements:
- Protection of cloud databases at cloud scale: RecoverX is a multi-platform data protection solution that can protect multiple distributed databases (NoSQL such as Apache Cassandra, MongoDB, and more forthcoming). Rather than a rip-and-replace solution, RecoverX is a new tool for DevOps and IT admins to protect 3rd platform applications deployed on distributed databases. It is deployed as a three-node, clustered solution and is architected to ensure no inline processing and no bottleneck in the data path. These two capabilities allow RecoverX to perform at cloud-scale with low RPO/RTO.
- Application consistency: All backup and recovery solutions are not the same. Most existing cloud backup solutions perform VM-level backups by hooking into the standard virtualization platform APIs such as VADP APIs from VMware. Backups taken at VM-level are not application consistent (except in few scenarios when VSS is used). Without application consistency, databases need to run repairs (time consuming) and often times the backup may be corrupted. RecoverX provides true application-level consistency for distributed databases that results in massive RTO savings and reduced application downtime.
- Multi-cloud capability: RecoverX is a software-only solution that may be deployed on any compute resource and use cloud-native object stores such as AWS S3 or Google Cloud Storage to store backup data. RecoverX has built-in operational resiliency to handle wide range of network and storage performance that are witnessed in different cloud networks. It also integrates natively with identity and access management (IAM) authentication technologies supported by different cloud providers. Most importantly, enterprises can recover their data across the clouds as shown in the diagram – for example, backups stored on AWS S3 storage can be restored on Google Cloud Platform and vice versa. This allows enterprises maximum flexibility in deploying applications by minimizing dependence on a specific cloud infrastructure.
- Single pane of glass: We believe that federated management of multiple RecoverX systems deployed across multiple clouds will provide a single pane of glass to IT admins and devops teams. Our RESTful API based architecture already allows integration of multiple RecoverX systems with existing enterprise operational monitoring tools. In addition, we are building the global management capability in our graphical user interface to simplify operations of RecoverX in Multi-cloud deployments.
- Native backup in IaaS and PaaS environments: This is the next big frontier for a cloud data backup solution — making data protection invisible! The enterprises just need to provide connectivity to their database clusters and the requested service level (e.g. daily backups or hourly backups), the rest of the operational tasks are managed by the cloud backup service. The IT admins and DevOps teams get access to data protection service via their cloud service interface that provides them a global view of data protection status across all deployments and allows them to restore across database clusters and clouds.
Where Are We Headed?
At Datos IO, we are working to bring our vision to reality. We have taken the first few steps in this direction by building a cloud-scale architecture for application-consistent backups of 3rd platform applications deployed on distributed databases. By making Datos IO RecoverX natively available in public cloud environments, we are now taking a step toward bringing the ease of use and adoption for cloud-first deployments.