Data Engineering Resources

A Guide to Deploying AI for Real-Time Content Moderation

A step-by-step guide to deploying an AI-powered real-time content moderation system, covering data collection, processing, models, and human review.

January 20, 2025

by Rahul JAIN

· 2,020 Views

Building a Reactive Event-Driven App With Dead Letter Queue

Learn how to build fault-tolerant, reactive event-driven applications using Spring WebFlux, Apache Kafka, and Dead Letter Queue to handle data loss efficiently.

January 20, 2025

by Sulakshana Singh

· 3,189 Views · 4 Likes

Optimizing Prometheus Queries With PromQL

Count worker nodes and track resource changes in Prometheus using PromQL. Explore queries, best practices, and dynamic thresholds for Kubernetes monitoring.

January 20, 2025

by Ganesh Bhat

· 7,347 Views · 4 Likes

Troubleshooting Connection Issues When Connecting to MySQL Server

Resolve MySQL connection issues with step-by-step fixes for errors like access denied, host not allowed, and authentication protocol mismatches.

January 20, 2025

by Shareef Sha

· 2,861 Views · 3 Likes

Chain-of-Thought Prompting: A Comprehensive Analysis of Reasoning Techniques in Large Language Models

Chain-of-thought (CoT) prompting enables LLMs to improve their reasoning capabilities. This paper explores various CoT techniques and their practical limitations.

January 20, 2025

by Pier-Jean MALANDRINO

CORE

· 3,099 Views · 3 Likes

Creating Artificial Doubt Significantly Improves AI Math Accuracy

LLMs are better at math with a "verified reasoning trajectory" — an opportunity to review their steps and determine if the math they're doing makes sense.

January 17, 2025

by mike labs

· 2,694 Views

Dark Data: Recovering the Lost Opportunities

Dark data is the vast amounts of unstructured information collected by organizations that often go unused. It includes emails, customer interactions, sensor data, etc.

January 17, 2025

by Vijay Singh Khatri

CORE

· 2,935 Views

Business Logic Database Agent

A natural language agent to create database business logic using declarative rules: an admin web app, API, and database in 1 minute. Download to customize in your IDE.

January 17, 2025

by Val Huber

CORE

· 3,009 Views · 4 Likes

Talk to Your Project: An LLM Experiment You Can Join and Build On

Find out the result of my experiment on integrating LLM into a console application and the insights I gained from the experience.

January 17, 2025

by Filipp Shcherbanich

CORE

· 2,646 Views · 2 Likes

Schema Changes Are a Blind Spot

Schema changes and migrations can quickly spiral into chaos, leading to significant challenges. Let's see how to overcome these challenges.

January 17, 2025

by Adam Furmanek

CORE

· 2,935 Views · 4 Likes

ArangoDB: Achieving Success With a Multivalue Database

ArangoDB's multimodel capabilities simplify handling key-value, document, and graph data in one database. Jakarta NoSQL enables seamless integration.

January 16, 2025

by Otavio Santana

CORE

· 3,091 Views · 1 Like

Understanding Leaderless Replication for Distributed Data

Learn about leaderless replication: its trade-offs, direct writes vs. coordination-based approaches, failure handling, and commercial databases in distributed systems.

January 16, 2025

by Stelios Manioudakis, PhD

CORE

· 8,311 Views · 4 Likes

Feature Flags in .NET 8 and Azure

This article describes the importance of feature flags and how they change the way we develop applications while reducing the risk of regressions.

January 16, 2025

by Naga Santhosh Reddy Vootukuri

CORE

· 4,494 Views · 3 Likes

You Need to Validate Your Databases

Without proper management, the risk of misconfigured databases increases — just as Heroku experienced. Learn how to steer clear of similar mistakes.

January 16, 2025

by Adam Furmanek

CORE

· 4,574 Views · 6 Likes

In this tutorial, we will take an in-depth look at Google Analytics Hub, a tool to securely share and access data, simplifying collaboration and analysis

January 16, 2025

by Karteek Kotamsetty

· 11,756 Views · 3 Likes

Efficient Long-Term Trend Analysis in Presto Using Datelists

To make long-term trend analysis easier, we can leverage datelists, where we store each metric value corresponding to a date in an array in a sequential manner.

January 15, 2025

by Ajay Krishnan Prabhakaran

· 3,525 Views

Kafka vs NATS: A Comparison for Message Processing

Kafka and NATS are both popular tools for message processing. This article provides a comparison between Kafka and NATS.

January 15, 2025

by Josson Paul Kalapparambath

· 4,209 Views · 2 Likes

Consistency Conundrum: The Challenge of Keeping Data Aligned

Maintaining consistency is crucial to ensure a unified view of the data, which is essential for the correct functioning of distributed applications.

January 15, 2025

by Ammar Husain

· 2,183 Views

Bye Tokens, Hello Patches

Meta's BLT architecture is a better way to scale LLMs that may lead us to replace tokenization with a patches-based approach.

January 15, 2025

by mike labs

· 3,451 Views · 2 Likes

Data-First IDP: Driving AI Innovation in Developer Platforms

A Data-First IDP integrates governance, traceability, and quality into workflows, transforming how data is managed, enabling scalable, AI-ready ecosystems.

January 15, 2025

by Paul Gale

· 5,818 Views · 2 Likes

The Latest Data Engineering Topics