DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

The Latest Big Data Topics

article thumbnail
AI Agents for Data Warehousing
AI agents are revolutionizing data warehousing by enhancing efficiency, accuracy, and automation across various aspects of data management today.
March 4, 2025
by Ajay Tanikonda
· 4,502 Views · 1 Like
article thumbnail
Materialized Views in Data Stream Processing With RisingWave
Materialized views enhance data streaming by improving incremental computation, enabling efficient retrieval and calculation of aggregated or pre-processed data.
March 3, 2025
by Gautam Goswami DZone Core CORE
· 3,602 Views · 2 Likes
article thumbnail
Modern Data Processing Libraries: Beyond Pandas
In this article, we explore the alternatives to pandas for data processing and data analysis. We'll compare and contrast based on performance.
March 3, 2025
by Vidyasagar (Sarath Chandra) Machupalli FBCS DZone Core CORE
· 6,476 Views · 6 Likes
article thumbnail
Doris Lakehouse Integration: A New Approach to Data Analysis
Doris Lakehouse Integration bridges data lakes and warehouses and enables seamless access, faster queries, unified management, and greater data value.
February 28, 2025
by Darren Xu
· 6,249 Views · 3 Likes
article thumbnail
Exploring IoT's Top WebRTC Use Cases
WebRTC can handle both high-quality media streaming and efficient data sharing, making it a versatile tool for device developers.
February 28, 2025
by Carsten Rhod Gregersen
· 3,796 Views · 1 Like
article thumbnail
Modern ETL Architecture: dbt on Snowflake With Airflow
Build a scalable ETL pipeline with dbt, Snowflake, and Airflow, and address data engineering challenges with modular architecture, CI/CD, and best practices.
February 27, 2025
by Digvijay Waghela
· 6,183 Views · 2 Likes
article thumbnail
Top Methods to Improve ETL Performance Using SSIS
Improve ETL performance in SSIS with parallel extraction, optimized transformations, and proper configuration of concurrency, batch sizes, and data types.
February 27, 2025
by DZone Editorial
· 5,617 Views · 1 Like
article thumbnail
Cloud-Driven Analytics Solution Strategy in Healthcare
Detailed insights into compute resource management, cluster optimization, storage efficiency, and cost governance in cloud-based environments.
February 27, 2025
by Abrar Ahmed Syed
· 4,800 Views · 5 Likes
article thumbnail
How to Scale Elasticsearch to Solve Your Scalability Issues
Scaling Elasticsearch requires balancing sharding, query performance, and memory tuning for optimal efficiency in high-traffic, real-time applications.
February 26, 2025
by Vivek Kumar
· 7,802 Views · 3 Likes
article thumbnail
Spark Job Optimization
Spark jobs can be optimized to maximize resource utilization in a cluster, improving performance and reducing costs for large-scale data processing.
February 25, 2025
by Chandra Shekar r Chekuri
· 3,283 Views · 1 Like
article thumbnail
The Future of Data Lakehouses: Apache Iceberg Explained
This blog post is the first in a three-part series exploring Apache Iceberg and its role in modern data architectures and the emergence of data lakehouses.
February 25, 2025
by Fawaz Ghali, PhD DZone Core CORE
· 4,079 Views · 5 Likes
article thumbnail
The Hidden Cost of Dirty Data in AI Development
Dirty data weakens AI, increases costs, introduces bias, and causes compliance risks. Strong data governance ensures reliable AI outcomes.
February 25, 2025
by Ilya Dudkin DZone Core CORE
· 4,006 Views · 3 Likes
article thumbnail
Deduplication of Videos Using Fingerprints, CLIP Embeddings
Video deduplication optimizes storage by removing duplicates using techniques like segmentation, embeddings, and clustering to manage massive datasets efficiently.
February 21, 2025
by Praneeth Reddy Vatti
· 6,844 Views · 5 Likes
article thumbnail
Scaling Image Deduplication: Finding Needles in a Haystack
Learn to efficiently deduplicate 100M+ images using distributed architectures, embeddings, FAISS for ANN search, and clustering to ensure accurate results.
February 20, 2025
by Praneeth Reddy Vatti
· 5,751 Views · 3 Likes
article thumbnail
Data Pattern Automation With AI and Machine Learning
Pattern recognition and AI improve data workflows, automate insights, and drive efficiency in business processes across industries.
February 19, 2025
by Sandip Gami
· 4,097 Views · 2 Likes
article thumbnail
ETL Generation Using GenAI
Learn about how GenAI automates ETL pipelines, generates code, adapts to schema changes, and improves data processes with speed, efficiency, and precision.
February 14, 2025
by Ramesh Daddala
· 6,410 Views · 2 Likes
article thumbnail
Loading XML into MongoDB
Learn how to export XML data to MongoDB using SmartXML ETL tools, simplifying the process and ensuring efficient data handling and storage.
February 12, 2025
by Luca Sanders
· 6,745 Views · 1 Like
article thumbnail
The Right ETL Architecture for Multi-Source Data Integration
Dedicated ETL pipelines are easy to set up but hard to scale, while common pipelines offer efficiency at the cost of complexity. Know which one to choose.
February 12, 2025
by Murat Balkan DZone Core CORE
· 6,061 Views · 1 Like
article thumbnail
SQL as the Backbone of Big Data and AI Powerhouses
SQL powers Big Data and AI with tools like BigQuery, remaining a cornerstone of data-driven innovation through its simplicity and adaptability.
February 11, 2025
by Medha Gupta
· 3,508 Views · 1 Like
article thumbnail
Relational DB Migration to S3 Data Lake Via AWS DMS, Part I
This article discusses the challenges faced during relational database migration to AWS using DMS, including source data, logging, and network bandwidth issues.
February 7, 2025
by Vijay Bhosale
· 5,292 Views · 3 Likes
  • Previous
  • ...
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • ...
  • Next
  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook
×