DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

Related

  • An Introduction To Open Table Formats
  • Starburst Unveils Fully Managed 'Icehouse' for Near Real-Time Analytics on the Open Data Lakehouse
  • Snowflake Micro-Partitioning: Technical Insights, Examples, and Advanced Developer Guide
  • Unlocking Data Insights and Architecture: Data Warehouses, Lakes, and Lakehouses

Trending

  • Microservices: Externalized Configuration
  • Optimizing Databricks Spark Pipelines Using Declarative Patterns
  • YOLOv5 PyTorch Tutorial
  • A Comprehensive Guide to Prompt Engineering
  1. DZone
  2. Data Engineering
  3. Big Data
  4. The Streaming Plane

The Streaming Plane

This concept addresses the need for real-time data integration and processing, enabling organizations to harness the power of both planes simultaneously.

By 
hubert dulay user avatar
hubert dulay
·
Dec. 09, 23 · Analysis
Likes (2)
Comment
Save
Tweet
Share
4.1K Views

Join the DZone community and get the full member experience.

Join For Free

Zhamack Dehghani nicely described the architectural data planes. In the dynamic landscape of data management, the concept of the "data divide" has emerged as a pivotal idea that highlights the crucial distinction between two essential components: the operational data plane and the analytical data plane. This concept is particularly relevant in today's data-driven world, where organizations strive to extract maximum value from their data assets. Understanding the data divide between these two planes is fundamental for devising effective strategies to manage, process, and derive insights from data.

Introduction to the Streaming Plane: Bridging the Operational and Analytical Data Planes

This bridge has traditionally been a one-way highway from the operational to the analytical plane. The path in the opposite direction is an arduous, awkward, and costly one that includes solutions named: Reverse ETL (rETL) and Data Activation. These solutions try to extract already cleansed and mastered data residing in the analytical plane from the data systems that aren’t optimized for large extraction.

In the ever-evolving landscape of data management, a prominent dimension acts as a multi-directional bridge between the operational and analytical data planes: the streaming plane. This concept addresses the need for real-time data integration and processing, enabling organizations to harness the power of both planes simultaneously. The streaming plane plays a pivotal role in achieving a holistic and dynamic data ecosystem, allowing businesses to respond to events in real time while also gaining valuable insights from historical data.

Streaming and real-time systems are positioning themselves by enabling access to both real-time and historical data without the limitations that the other data planes have.

Originally this plane just did ETL where the destinations were data lakes, data warehouses, and lake houses. As the systems in the streaming plane start to hold tables and materialized views, clients from external applications and internal dashboards can access data from each other.

Systems that live in the streaming plane tend to have these characteristics:

  • They all must source from a streaming platform like Kafka, Redpanda, Pulsar, Kinesis, etc.

  • They can build materialized views.

  • They can source historical data from an object store, data warehouse, data lake, or lake house.

  • They can transform data.

  • They can serve data either synchronously or asynchronously.

Streaming Cloud

These streaming plane systems like to focus on a subset of these characteristics which creates a cloud of streaming systems spanning from operational to analytical.

These streaming and real-time systems can create tables or materialized views of data from either side of the streaming plane. The tabular structures can range from row-based, column-based, or embedded stores depending on the use case.

Consumers of data in the streaming plane have 2 ways of consuming data — synchronously from the API Gateway and asynchronously from topics in a streaming platform. In fact, we are also seeing more transactional systems migrate into the fold with databases with columnar formats bringing more analytics to the edge. We will need to wait to see how these newer systems integrate into streaming and real-time analytics.

The streaming plane is assuming the role of Data-as-a-Service (DaaS) or a real-time streaming data mesh where the data catalog is the system holding the tabular structures that expose them synchronously or asynchronously.

The streaming plane is where streaming and batching converge to bring real-time analytics with historical context to all data planes. It’s built on the foundation of streaming platforms, stream processors, and connectors that allow for multi-region, multi-cloud, multi-domain, and hybrid architectures.

Data Products

The tabular structures in the streaming plane expose both synchronous and asynchronous endpoints and provide enough metadata to show where data originates and how they are processed. More importantly, metadata that describes consumability, to name a few:

  • Throughput: Can this data product handle another consumer?

  • SLAs: What guarantees does this data product provide?

  • Rate limits

Try

Most likely you have some of the components that build a streaming plane. You only need a system that exposes the tabular structures and can serve them at scale.

Catch

Make sure your security is set up so that this authorization and authentication can be used for all of the data product endpoints.

Finally

Data Mesh implemented as a streaming plane is achievable. You can read more in this book.

Streaming Data Mesh

Data integration Data lake Data management Data warehouse Cloud

Published at DZone with permission of hubert dulay. See the original article here.

Opinions expressed by DZone contributors are their own.

Related

  • An Introduction To Open Table Formats
  • Starburst Unveils Fully Managed 'Icehouse' for Near Real-Time Analytics on the Open Data Lakehouse
  • Snowflake Micro-Partitioning: Technical Insights, Examples, and Advanced Developer Guide
  • Unlocking Data Insights and Architecture: Data Warehouses, Lakes, and Lakehouses

Partner Resources

×

Comments

The likes didn't load as expected. Please refresh the page and try again.

  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook