DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

The Latest Big Data Topics

article thumbnail
Integrating Data Engineering Into Artificial Intelligence
Importance of tight coupling between data engineering and artificial intelligence for data engineers and data scientists and best practices.
September 26, 2024
by Akshay Agarwal
· 3,716 Views · 3 Likes
article thumbnail
Using Flink, Kafka, and NiFi for Real-Time Airport Arrivals and Departures
Learn to build a streaming application using the best of NiFi, Kafka, and Flink for event-driven apps. OpenSky networks rest feeds provide all the data.
September 26, 2024
by Timothy Spann
· 4,503 Views · 5 Likes
article thumbnail
Automating Data Pipelines With Snowflake: Leveraging DBT and Airflow Orchestration Frameworks for ETL/ELT Processes
This article explores the automation of data pipelines using Snowflake, dbt, and Airflow, detailing best practices for efficient data processing and orchestration.
September 25, 2024
by Harshavardhan Yedla
· 5,442 Views · 2 Likes
article thumbnail
Feature Engineering Transforming Predictive Models
Delve into the transformative power of feature engineering in applied machine learning, and learn how carefully crafted features can elevate your models.
September 25, 2024
by Sundeep Goud Katta
· 4,055 Views · 3 Likes
article thumbnail
What Is a Data Pipeline?
In this post, we examine data pipelines, explore their benefits, compare them to other data processes, and discuss various implementation methods.
Updated September 24, 2024
by Garrett Alley
· 52,861 Views · 18 Likes
article thumbnail
Navigating the Regulatory Maze: Simplifying Data Compliance
Explore how IT professionals can address evolving regulatory compliance challenges in data management, reducing organizational risks and costs.
September 20, 2024
by Tom Smith DZone Core CORE
· 4,395 Views · 1 Like
article thumbnail
Kafka Message Testing
Utilizing tools like RecordCaptor, as well as adhering to isolation principles and clear separation of test stages, ensures high accuracy and efficiency.
September 18, 2024
by Anton Belyaev
· 4,625 Views · 8 Likes
article thumbnail
Setting Up Secure Data Lakes for Starlight Financial: A Guide to AWS Implementation
This guide delves into securing financial data lakes with AWS services, focusing on best practices for data protection and compliance.
September 13, 2024
by Harsh Daiya DZone Core CORE
· 5,645 Views · 1 Like
article thumbnail
Optimizing Data Management for AI Success: Industry Insights and Best Practices
Explore key strategies for effective data management in AI projects, including real-time access, federated queries, and data literacy for developers and engineers.
September 11, 2024
by Tom Smith DZone Core CORE
· 5,588 Views · 1 Like
article thumbnail
The Significance of Complex Event Processing (CEP) With RisingWave for Delivering Accurate Business Decisions
Learn more about CEP, how it addresses a key challenge in real-time processing by detecting patterns in data streams, and compare FlinkCEP and RisingWave.
September 11, 2024
by Gautam Goswami DZone Core CORE
· 5,261 Views · 1 Like
article thumbnail
Real-Time GenAI With RAG Using Apache Kafka and Flink to Prevent Hallucinations
Learn about context-specific real-time Generative AI (GenAI) with Retrieval Augmentation Generation (RAG) using Kafka and Flink to prevent hallucinations.
September 11, 2024
by Kai Wähner DZone Core CORE
· 4,668 Views · 2 Likes
article thumbnail
Exploring Real-Time Data Ingestion Into Snowflake Using CockroachDB, Redpanda, and Kafka Connect
Explore Kafka Connect as a solution to stream changefeeds into Snowflake for greater control over how messages are delivered to Snowflake.
September 10, 2024
by Artem Ervits DZone Core CORE
· 5,863 Views · 1 Like
article thumbnail
Accelerate Your Journey to a Modern Data Platform Using Coalesce
This article will identify key challenges organizations face today in managing data platforms, and explore how advanced ETL tools can address these challenges.
September 10, 2024
by Asia Banu Shaik
· 5,795 Views · 5 Likes
article thumbnail
Data Storage Formats for Big Data Analytics: Performance and Cost Implications of Parquet, Avro, and ORC
This article compares the performance and cost efficiency of three storage formats Parquet, Avro, and ORC on Google Cloud Platform.
September 9, 2024
by Rahul Sarabu
· 8,343 Views · 4 Likes
article thumbnail
Principles of Modern Data Infrastructure
Explore principles of modern data infrastructure such as scalability, high availability, speed, security, maintainability, efficiency, and developer experience.
September 6, 2024
by Joe Zhou
· 5,515 Views · 1 Like
article thumbnail
Keeping Two Multi-Master Databases Aligned With a Vector Clock
In this article, learn about an experience in keeping two different databases aligned with two different technologies by using an application-level solution.
September 5, 2024
by Claudio Guidi DZone Core CORE
· 6,897 Views · 4 Likes
article thumbnail
Setting Up a Data Warehouse for Starlight: A Comprehensive Guide
Learn architectural considerations, essential tools, and technologies, and see sample code snippets to illustrate key steps of a data warehouse setup.
September 5, 2024
by Harsh Daiya DZone Core CORE
· 4,224 Views · 1 Like
article thumbnail
How to Conduct Effective Data Security Audits for Big Data Systems
Learn key strategies for conducting thorough data security audits in big data systems to safeguard sensitive information.
September 4, 2024
by Devin Partida
· 5,547 Views · 1 Like
article thumbnail
MLOps: How to Build a Toolkit to Boost AI Project Performance
AI projects could end up among the 90% that fail due to common implementation pitfalls. Here, learn how to change the game with the right MLOps tools.
September 3, 2024
by Alexander Simonov
· 7,131 Views · 4 Likes
article thumbnail
Open Standards for Data Lineage: OpenLineage for Batch and Streaming
Explores trends and efforts to provide an open standard with OpenLineage, and how data governance solutions help fulfill enterprise-wide data governance needs.
September 3, 2024
by Kai Wähner DZone Core CORE
· 6,462 Views · 1 Like
  • Previous
  • ...
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • ...
  • Next
  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook
×