DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

The Latest Data Topics

article thumbnail
Integrating Retrieval-Augmented Generation (RAG) With Agentic AI: Harnessing Elasticsearch Vector Databases for Enterprise AI Systems
A practical overview of using retrieval-augmented generation and agentic AI with Elasticsearch to build reliable, enterprise-ready LLM systems.
January 14, 2026
by Devdas Gupta
· 2,486 Views · 2 Likes
article thumbnail
Your Next Customer Is a Bot
E-commerce brands lose billions annually to cart abandonment, with over 70% of checkouts left unfinished. A new technology, Agentic AI, is here to solve this.
January 13, 2026
by Akash Lomas
· 1,395 Views · 1 Like
article thumbnail
Optimizing Financial Data Pipelines: Accelerating OneStream-to-Snowflake Exports by 85%
A case study on optimizing financial data pipelines by refactoring legacy SQL batch inserts into a "Stage and Copy" architecture.
January 13, 2026
by Sridhar Mannava
· 916 Views · 3 Likes
article thumbnail
MCP Servers Are Everywhere, but Most Are Collecting Dust: Key Lessons We Learned to Avoid That
Teams rushing to build MCP servers are discovering that enthusiasm doesn’t translate into usefulness. This article unpacks why and how to make your MCP server valuable.
January 13, 2026
by Thomas Johnson DZone Core CORE
· 2,551 Views · 4 Likes
article thumbnail
Apache Spark 4.0: What’s New for Data Engineers and ML Developers
Spark 4.0 brings Spark Connect, enhanced SQL (PIPE, VARIANT), richer Python APIs, and advanced streaming — modernizing Spark for faster, more flexible 2025 workloads.
January 12, 2026
by harshraj bhoite
· 2,032 Views
article thumbnail
Serverless Spark Isn't Always the Answer: A Case Study
Processing 500M+ records with 100 concurrent users under a 5-minute SLA demands smart architecture. We evaluate seven compute models and why hybrid approaches often win.
January 12, 2026
by Janani Annur Thiruvengadam DZone Core CORE
· 1,506 Views · 1 Like
article thumbnail
Mastering Fluent Bit: Developer Guide to Telemetry Pipeline Routing (Part 12)
This intro to mastering Fluent Bit covers telemetry pipeline routing mechanisms, tag-based, conditional, and label-based, with hands-on examples for developers.
January 9, 2026
by Eric D. Schabell DZone Core CORE
· 1,520 Views · 2 Likes
article thumbnail
Essential Techniques for Production Vector Search Systems Part 2 - Binary Quantization
Proven techniques for production vector search including when to use each one, how to combine them effectively, trade offs to understand before deployment.
January 9, 2026
by Pavan Vemuri DZone Core CORE
· 1,907 Views · 3 Likes
article thumbnail
Multi-Region Apache Kafka using Synchronous Replication for Disaster Recovery With Zero Data Loss (RPO=0)
Kafka isn’t one-size-fits-all. Choose between self-managed, serverless, or BYOC deployments. New RPO=0 options now enable zero data loss for real-time applications.
January 9, 2026
by Kai Wähner DZone Core CORE
· 1,582 Views · 2 Likes
article thumbnail
Essential Techniques for Production Vector Search Systems Part 1 - Hybrid Search
Proven techniques for production vector search including when to use each one, how to combine them effectively, and trade offs to understand before deployment.
January 8, 2026
by Pavan Vemuri DZone Core CORE
· 1,936 Views · 2 Likes
article thumbnail
Secure Log Tokenization Using Aho–Corasick and Spring
This article shows how to use the Aho–Corasick algorithm and deterministic tokenization in Spring Boot to intercept logs in real time, remove sensitive values.
January 8, 2026
by Balakumaran Sugumar
· 1,941 Views · 3 Likes
article thumbnail
Solving the Cold Start Problem in Edge AI: A Guide to Data-Saving Learning
Update edge AI models efficiently using Mix Up and contribution sampling to overcome domain shift with minimal data, ensuring continuous evolution without forgetting.
January 6, 2026
by Dippu Kumar Singh
· 3,559 Views
article thumbnail
Metadata, Not Data Volume, Is the Real Bottleneck in Modern Data Lakes
In Apache Iceberg data lakes, growing snapshots and manifests often make metadata resolution — not data scanning — the primary performance bottleneck.
January 6, 2026
by Vivek Venkatesan
· 3,294 Views
article thumbnail
Why Data Engineers Need to Think Like Product Managers
Data engineers who think like product managers build more valuable, trusted, and user-centric data systems; they focus on outcomes, ownership, and UX, not just pipelines.
January 6, 2026
by harshraj bhoite
· 2,222 Views
article thumbnail
Understanding Parquet Scans: How Readers Skip Work and Stay Fast
Parquet accelerates scans by skipping data through metadata driven pushdowns. This article explains how the main mechanisms work in practice.
January 6, 2026
by Hitarth Trivedi
· 2,399 Views · 1 Like
article thumbnail
A Practical Guide to Semantic Caching With Redis LangCache
Learn how to use Redis LangCache to semantically cache LLM prompts and responses, reducing inference costs and improving performance.
January 6, 2026
by Subhashini Raman
· 2,794 Views · 2 Likes
article thumbnail
Unlocking Hidden Value in Dirty Data: A Practical NLP Pattern for Legacy Records
Legacy systems are full of free-text fields where valuable business data goes to die. NLP pipelines turn messy maintenance logs into structured, actionable insights.
January 5, 2026
by Dippu Kumar Singh
· 1,293 Views · 5 Likes
article thumbnail
Securing Verifiable Credentials With DPoP: A Spring Boot Implementation
DPoP binds access tokens to a client's key so even if intercepted, they can't be misused. It's mandatory for EUDI/HAIP 1.0 and supported since Spring Boot 3.5.
January 5, 2026
by Kyriakos Mandalas DZone Core CORE
· 3,853 Views · 4 Likes
article thumbnail
Tired of Reverse-Engineering Code? A Data-First Pattern for Legacy Modernization
Legacy modernization fails because teams try to decipher millions lines of code. Here is a pattern to slim down systems by reverse-engineering the data.
January 2, 2026
by Dippu Kumar Singh
· 2,539 Views · 2 Likes
article thumbnail
LLMs in Data Engineering: How Generative AI is Changing ETL and Analytics
LLMs reshape data engineering by automating ETL tasks, enabling natural language analytics, and empowering faster, smarter decision-making without replacing engineers.
January 1, 2026
by harshraj bhoite
· 2,685 Views · 1 Like
  • Previous
  • ...
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • ...
  • Next
  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook
×