DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

Zones

Culture and Methodologies Agile Career Development Methodologies Team Management
Data Engineering AI/ML Big Data Data Databases IoT
Software Design and Architecture Cloud Architecture Containers Integration Microservices Performance Security
Coding Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Culture and Methodologies
Agile Career Development Methodologies Team Management
Data Engineering
AI/ML Big Data Data Databases IoT
Software Design and Architecture
Cloud Architecture Containers Integration Microservices Performance Security
Coding
Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance
Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks

Curious about the future of data-driven systems? Join our Data Engineering roundtable and learn how to build scalable data platforms.

Data Engineering: The industry has come a long way from organizing unstructured data to adopting today's modern data pipelines. See how.

Threat Detection: Learn core practices for managing security risks and vulnerabilities in your organization — don't regret those threats!

Managing API integrations: Assess your use case and needs — plus learn patterns for the design, build, and maintenance of your integrations.

Related

  • The State of Observability 2024: Navigating Complexity With AI-Driven Insights
  • Making Waves: Dynatrace Perform 2024 Ushers in New Era of Observability
  • Cost Optimization Strategies for Managing Large-Scale Open-Source Databases
  • Unifying SecOps and Observability for Enhanced Cloud Security in Azure

Trending

  • How Jenkins Can Stay Relevant in the Next Decade
  • Optimizing Vector Search Performance With Elasticsearch
  • Front-End Debugging, Part 1: Not Just Console Log
  • Lifecycle Microservices With GenAI Tools
  1. DZone
  2. Software Design and Architecture
  3. Cloud Architecture
  4. Cloud Data: Observability Is the Forgotten Data

Cloud Data: Observability Is the Forgotten Data

In this continuation of the cloud data series, discuss the forgotten data that is often overlooked when planning for cloud-native architectural solutions.

By 
Eric D.  Schabell user avatar
Eric D. Schabell
DZone Core CORE ·
Aug. 23, 22 · Opinion
Likes (2)
Comment
Save
Tweet
Share
5.7K Views

Join the DZone community and get the full member experience.

Join For Free

This article is a continuation of a series of posts to better understand how the pitfalls around the collection, maintenance, and storage of your cloud data can mean the difference between failure and success within your cloud strategy.  The concepts in this series stem from brainstorming with my good friend Roel Hodzelmans and are additionally inspired by reactions from the audience to a talk given previously in Dublin, Ireland.

The initial post provided an introduction to cloud and data, and what that means in a cloud-native architecture beyond just storage. In this second article, we discuss the forgotten data that is often overlooked when planning for cloud-native architectural solutions.

Observability Is the Forgotten Data

When you look at observability, you might be thinking about data generated from logs, traces, metrics, and even events across your landscape. What you probably do not realize is that many of your applications and platforms have standard installation settings that generate large amounts of observability data by default. If you are not accounting for all that data being generated when you are heading into the cloud, you are going to have a hard time meeting your budget constraints for deploying and running your production solutions.

Martin Mao stated earlier this year that the growth of observability data is out of control. He talks about how organizations don't mind paying for that data if it led to better outcomes, such as happier customers, higher availability, faster remediation, or more revenue.

"Paying more for logging/metrics/tracing doesn't equate to a positive user experience. Consider how much data can be generated and shipped. $$$. You still need good people to turn data into action. It's remarkable how common this situation is, where an organization is paying more for their observability data (typically metrics, logs, traces, and sometimes events), then they do for their production infrastructure." - Martin Mao

Let's take a look at a simple experiment presented in an article on the hidden cost of data observability, where a simple "Hello, World!" application was deployed on a four-node Kubernetes cluster on GKE (see the article for details of the setup). Scripts were used to simulate load on the application and 30 days of observability data were collected in the following categories:
  • Tracing - One trace per second over 30 days totaled 2.5M traces for a total data size of 161GB.
  • End user metrics - Each back-end call generated a user interaction, so over 30 days, that's 2.5M EUM traces for a total data size of 1GB.
  • Logs - Mileage may vary depending on the configuration of your logging, but here, it was a 30-day total data size of 3.4GB.
  • Metrics - Collected using Prometheus configured for a 10-second sample rate across the cluster for a 30-day total data size of 285GB

Granted, this might not be a perfect example for your research, but it is simple and gives easy-to-follow results of just over 450GB of data for a single, simple application.

If you take into account that the average retention period for audits and compliance is at 13 months, you have to ask yourself how much data you are having to collect, transport, and store effectively across your cloud architecture(s). In modern cloud-native architectures, you can be deploying multiple times a day, where a container is sometimes only around for a few minutes or hours. The default of storing the observability data generated there may not need to be 13 months. Maybe trying setting retention periods for each data type can help with your generated data volume.

Also, consider the various environments that are set up and torn down weekly, or bi-weekly, such as test or lab environments. These certainly don't need extensive observability data retention, if any at all.

As Martin noted, paying for more data is one thing, but people are the core of any successful use case:

"Paying more for logging/metrics/tracing doesn't equate to a positive user experience. Consider how much data can be generated and shipped. $$$. You still need good people to turn data into action."

Who Owns These Decisions?

While realizing that there is a lot of unexpected cloud data coming out of your architecture, there remains an issue of who owns these decisions in your organization. The observability data explosion can cause a lot of issues and costs, but the question to answer is:

Do you dare to flip the switch on a new data collection in your architecture?

The following article in this series will take a look at what the industry is going to be doing in the near future to ensure there is a financial owner for their organization.

Observability Cloud Data (computing)

Published at DZone with permission of Eric D. Schabell, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

Related

  • The State of Observability 2024: Navigating Complexity With AI-Driven Insights
  • Making Waves: Dynatrace Perform 2024 Ushers in New Era of Observability
  • Cost Optimization Strategies for Managing Large-Scale Open-Source Databases
  • Unifying SecOps and Observability for Enhanced Cloud Security in Azure

Partner Resources


Comments

The likes didn't load as expected. Please refresh the page and try again.

ABOUT US

  • About DZone
  • Support and feedback
  • Community research
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 100
  • Nashville, TN 37211
  • support@dzone.com

Let's be friends: