Big Data Resources

A Guide to Building Data Intelligence Systems: Strategic Practices to Building Robust, Ethical, and AI-Driven Data Structures

The foundation of data intelligence systems centers around transparency, governance, and the ethical and responsible exploitation of cutting-edge technologies, particularly GenAI.

November 8, 2024

by Frederic Jacquet

CORE

· 2,209 Views · 2 Likes

The Data (Pipeline) Movement: A Guide to Real-Time Data Streaming and Future Proofing Through AI Automation and Vector Databases

Dive into the essential strategies for leveraging real-time data streaming, AI automation, and vector databases to drive actionable insights.

November 7, 2024

by Tuhin Chattopadhyay

CORE

· 2,336 Views · 3 Likes

Building Scalable AI-Driven Microservices With Kubernetes and Kafka

AI microservices, Kubernetes, and Kafka enable scalable, resilient intelligent applications through modular architecture and efficient resource management.

November 6, 2024

by Dileep Kumar Pandiya

· 4,488 Views · 3 Likes

Leveraging Apache Flink Dashboard for Real-Time Data Processing in AWS Apache Flink Managed Service

Find out how to utilize the Apache Flink Dashboard for monitoring, optimizing, and managing real-time data processing applications within AWS-managed services.

November 6, 2024

by Sneha Murganoor

· 25,628 Views · 7 Likes

Real-Time Data Streaming on Cloud Platforms: Leveraging Cloud Features for Real-Time Insights

Explore the key aspects of real-time data streaming and analytics on cloud platforms, including architectures, integration strategies, and future trends.

November 6, 2024

by Sreenath Devineni

CORE

· 13,001 Views · 3 Likes

Optimizing Your Data Pipeline: Choosing the Right Approach for Efficient Data Handling and Transformation Through ETL and ELT

ETL and ELT are vital for data integration and accessibility. Learn how to select the right approach based on your infrastructure, data volume, data complexity, and more.

November 5, 2024

by Ebere Oyekwe

· 13,971 Views · 3 Likes

The Modern Era of Data Orchestration: From Data Fragmentation to Collaboration

By embracing composability, organizations can position themselves to simplify governance and benefit from the greatest advances happening in our industry.

November 4, 2024

by Joel Lubinitsky

· 13,569 Views · 6 Likes

Digitalization of Airport and Airlines With IoT and Data Streaming Using Kafka and Flink

Take an in-depth look at IoT and data streaming using Kafka and Flink at airports such as Schiphol AMS and airlines like Lufthansa and Cathay.

November 4, 2024

by Kai Wähner

CORE

· 9,878 Views · 2 Likes

Optimizing Vector Search Performance With Elasticsearch

Optimize vector search in Elasticsearch through dimensionality reduction, efficient indexing, and automated parameter tuning for faster, more accurate results.

November 4, 2024

by Venkata Gummadi

· 17,624 Views · 5 Likes

Data Governance Essentials: Glossaries, Catalogs, and Lineage (Part 5)

Discover how business glossaries, data catalogs, and data lineage work together to enhance data quality, compliance, transparency, and operational efficiency.

November 1, 2024

by Sukanya Konatam

· 12,821 Views · 2 Likes

How to Identify Bottlenecks and Increase Copy Activity Throughput in Azure Data Factory

Learn how to optimize the throughput of a copy activity in Azure Data Factory by identifying bottlenecks, scaling integration runtimes, and more.

October 31, 2024

by Aravind Nuthalapati

· 38,433 Views · 3 Likes

Inside the World of Data Centers

There is a constant shift towards cloud-native applications. Here, take a look into the workings of data centers that are hosting these cloud infrastructures.

October 29, 2024

by Arpit Jain

· 5,061 Views · 1 Like

How to Design Event Streams, Part 1

This article series addresses commonly asked questions, best practices, practical examples, and info on how to get started with event-driven architectures.

October 28, 2024

by Adam Bellemare

· 6,157 Views · 2 Likes

The Power of Market Disruption: How to Detect Fraud With Graph Data

Market disruptors pave the way for innovation and break barriers once considered bulletproof. PuppyGraph uses market disruption and graph data to detect fraud.

October 28, 2024

by John Vester

CORE

· 58,217 Views · 3 Likes

Reactive Kafka With Spring Boot

Learn how to build generic, easily configurable, testable reactive consumers, producers, and DLT with Kotlin, Spring Boot, WebFlux, and Testcontainers.

October 25, 2024

by Ion Pascari

· 12,430 Views · 11 Likes

High-Speed Real-Time Streaming Data Processing

The article discusses the need for streaming data processing and evaluates available options. It explains that one size fits all is approach is not appropriate.

October 24, 2024

by Ashish Karalkar

· 9,676 Views · 3 Likes

Minimizing Latency in Kafka Streaming Applications That Use External API or Database Calls

Tired of latency slowing down your Kafka consumers? Learn how async operations, batching, and reactive frameworks like Spring WebFlux can help.

October 23, 2024

by Abhishek Goswami

· 3,147 Views · 3 Likes

Leveraging Event-Driven Data Mesh Architecture With AWS for Modern Data Challenges

Explore event-driven data mesh architecture, and how when combined with AWS, it becomes a robust solution for addressing complex data management challenges.

October 23, 2024

by Sunil Sharma

· 6,933 Views · 11 Likes

Building Predictive Analytics for Loan Approvals

Here, explore various techniques for loan approvals, using models like Logistic Regression and BERT, and applying SHAP and LIME for model interpretation.

October 23, 2024

by Akmal Chaudhri

CORE

· 4,364 Views · 1 Like

Automate Private Azure Databricks Unity Catalog Creation

This article provides detailed steps to automate a private Databricks Unity Catalog creation in Azure subscription.

October 21, 2024

by Soumya Barman

· 3,315 Views · 1 Like

The Latest Big Data Topics