DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

The Latest Big Data Topics

article thumbnail
Automating Threat Detection Using Python, Kafka, and Real-Time Log Processing
Durable stream, stable schema, entity-keyed partitions, DLQ for failures normalized field detections stay portable as sources evolve.
April 21, 2026
by Krishnaveni Musku
· 2,032 Views
article thumbnail
From APIs to Event-Driven Systems: Modern Java Backend Design
Modern Java backend design is evolving from traditional APIs to event-driven architectures, enabling more scalable, resilient, and real-time distributed systems.
April 20, 2026
by Ramya vani Rayala
· 4,335 Views · 8 Likes
article thumbnail
Metadata Driven Data Engineering: Declarative Pipeline Orchestration in Lakeflow
Define what you want with decorators, Lakeflow figures out how to run it, eliminating boilerplate and reducing operational overhead at scale.
April 20, 2026
by Seshendranath Balla Venkata
· 1,878 Views · 1 Like
article thumbnail
Training a Neural Network Model With Java and TensorFlow
Learn how to train a neural network model using the TensorFlow platform with Java and using a pre-trained model in a proper Spring Boot application.
April 17, 2026
by George Pod
· 3,128 Views · 1 Like
article thumbnail
You Are Using Claude Wrong (And So Is Everyone You Know)
Claude isn't ChatGPT with a different logo, it's built on different principles that reward a different way of working, and that difference compounds.
April 14, 2026
by Faisal Feroz
· 4,239 Views · 5 Likes
article thumbnail
Boost Your Spark Jobs: How Photon Accelerates Apache Spark Performance
Photon is Databricks’ native C++ engine that bypasses JVM bottlenecks by processing data in vectorized, SIMD-accelerated batches instead of row by row.
April 13, 2026
by Seshendranath Balla Venkata
· 2,854 Views · 1 Like
article thumbnail
Apache Spark 3 to Apache Spark 4 Migration: What Breaks, What Improves, What's Mandatory
A comprehensive guide to migrating from Apache Spark 3.x to Spark 4.0, covering breaking changes, new features, and mandatory updates for smooth transition.
April 10, 2026
by Rambabu Bandam
· 4,318 Views · 1 Like
article thumbnail
Schema Evolution in Delta Lake: Designing Pipelines That Never Break
Delta Lake prevents pipeline failures from schema drift using schema enforcement and schema evolution, allowing Spark pipelines to adapt safely to new columns.
April 10, 2026
by Seshendranath Balla Venkata
· 2,754 Views · 1 Like
article thumbnail
Why Queues Don’t Fix Scaling Problems
Queues absorb spikes but not sustained overload. Without backpressure, limits, and monitoring, backlogs grow until systems fail.
April 8, 2026
by David Iyanu Jonathan
· 3,442 Views · 2 Likes
article thumbnail
Spark on AmpereOne® M Arm Processors Reference Architecture
Deploy and tune Apache Spark on AmpereOne M, with setup steps, cluster configs, and benchmarks showing gains vs Ampere Altra in performance and efficiency.
April 6, 2026
by RamaKrishna Nishtala
· 3,299 Views · 2 Likes
article thumbnail
Hadoop on AmpereOne Reference Architecture
Hadoop on AmpereOne M shows improved throughput, scaling, and efficiency, with setup, tuning, and benchmark insights for optimizing big data workloads.
April 3, 2026
by RamaKrishna Nishtala
· 5,434 Views
article thumbnail
End-to-End Streaming Optimization: Kafka to Delta With Exactly-Once Guarantees
Kafka feeds the stream, Spark tracks progress via checkpoints, and Delta's transaction log ensures every event lands exactly once, even across failures and restarts.
April 1, 2026
by Seshendranath Balla Venkata
· 2,571 Views · 2 Likes
article thumbnail
Delta Change Data Feed Deep Dive: Building Incremental Pipelines Without Complexity
Delta CDF in Databricks enables pipelines to process only changed rows with commit metadata, simplifying incremental ETL without full scans.
April 1, 2026
by Seshendranath Balla Venkata
· 2,871 Views · 1 Like
article thumbnail
Queues Don't Absorb Load — They Delay Bankruptcy
Queues hide overload. Without back-pressure, limits, and scaling, lag just grows until failure. Bound queues, alert on lag, fail fast, and plan capacity.
March 30, 2026
by David Iyanu Jonathan
· 1,633 Views · 2 Likes
article thumbnail
Scaling Kafka Consumers: Proxy vs. Client Library for High-Throughput Architectures
Scaling Apache Kafka consumption requires new patterns; proxy layers and client libraries offer practical solutions for high-throughput.
March 30, 2026
by Kai Wähner DZone Core CORE
· 1,387 Views · 3 Likes
article thumbnail
Stop Leap-Second AI Drift in IoT Streams With PySpark
Leap seconds can corrupt timestamps and trigger AI drift in fintech IoT systems. Learn about drift types and how PySpark streaming fixes them in real time.
March 27, 2026
by Ram Ghadiyaram DZone Core CORE
· 2,117 Views · 1 Like
article thumbnail
From Stream to Strategy: How TOON Enhances Real-Time Kafka Processing for AI
The TOON data format specifically targets the propagation of structured, validated, and semantically consistent data, thereby reducing ambiguity in real time.
March 27, 2026
by Gautam Goswami DZone Core CORE
· 1,966 Views
article thumbnail
The Phantom Write Problem: Why Your Idempotency Implementation Is Silently Losing Data
A practical explanation of why idempotent APIs still produce phantom writes in production, and a race-free, transactional pattern to prevent them.
March 24, 2026
by Saumya Tyagi
· 3,123 Views · 2 Likes
article thumbnail
From DLT to Lakeflow Declarative Pipelines: A Practical Migration Playbook
Migrating from DLT to Lakeflow is mostly an API refactor, swapping DLT for pipelines, separating streaming and materialized tables, and updating CDC logic.
March 19, 2026
by Seshendranath Balla Venkata
· 3,967 Views · 1 Like
article thumbnail
How Piezoelectric Energy Harvesting Is Solving the Battery Waste Crisis in Industrial IoT
Industrial piezoelectric sensors decouple IIoT reliability from battery dependence that compromises data resolution and responsiveness.
March 18, 2026
by Emily Newton
· 3,646 Views
  • Previous
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • ...
  • Next
  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook
×