Big Data Resources

Understanding Data Compaction in 3 Minutes

Think of your disks as a warehouse: The compaction mechanism is like a team of storekeepers who help put away the incoming data.

June 6, 2023

by Shirley H.

· 3,007 Views · 1 Like

Decoding ChatGPT: The Concerns We All Should Be Aware Of

OpenAI's ChatGPT has quickly gained popularity and reached 100 million monthly users. However, users must avoid over-reliance on ChatGPT and maintain human judgment.

June 6, 2023

by Vaibhav Ghildiyal

· 3,142 Views · 1 Like

The AI Revolution: Embracing the Future of Technology and Automation

The development and use of intelligent computer systems capable of doing activities that normally require human intelligence is referred to as AI technology.

June 6, 2023

by Rajinder Kumar

· 3,514 Views · 2 Likes

CPU vs. GPU Intensive Applications

Learn about CPU vs. GPU intensive applications and why each is best suited for particular tasks. We explain parallelization, Amdahl’s Law, and how these concepts affect whether to use a CPU, GPU, or both for your application.

June 5, 2023

by Kevin Vu

· 3,634 Views · 2 Likes

Constructing Real-Time Analytics: Fundamental Components and Architectural Framework — Part 2

The fundamental components of real-time analytics are event streaming and a specialized database that can manage large amounts of data in less than a second.

June 2, 2023

by David Wang

· 4,096 Views · 2 Likes

Automating Data Quality Check in Data Pipelines

Are you looking for ways to automate data quality checks in your data pipelines? Here are some helpful tools that can streamline the process for you.

June 2, 2023

by Hardik Shah

· 4,189 Views · 1 Like

10 Effective Strategies To Safeguard Your IoT Network From Threats

The security of IoT networks is a challenge that needs to be addressed at a global scale. This blog will uncover some risks associated with IoT network security.

June 1, 2023

by Kamal R

· 3,348 Views · 1 Like

The State of Data Streaming for Manufacturing

The state of data streaming in the manufacturing industry explores case studies from BMW, Mercedes, Michelin, and Siemens powered by Apache Kafka.

June 1, 2023

by Kai Wähner

CORE

· 2,383 Views · 2 Likes

Knowing and Valuing Apache Kafka’s ISR (In-Sync Replicas)

To get more clarity about ISR in Apache Kafka, we should first carefully examine the replication process in the Kafka broker.

June 1, 2023

by Gautam Goswami

CORE

· 4,065 Views · 1 Like

Replacing Apache Hive, Elasticsearch, and PostgreSQL With Apache Doris

How does a data service company build its data warehouse? Simplicity is the best policy. See how we increased data writing efficiency by 75%.

May 31, 2023

by Twan Wang

· 3,010 Views · 1 Like

Rule-Based Prompts: How To Streamline Error Handling and Boost Team Efficiency With ChatGPT

The art of creating rule-based prompts for ChatGPT is accelerating the development of apps and services for any industry, and in this article, I share how to do it.

May 31, 2023

by Denis Avramenko

· 4,634 Views · 2 Likes

Data Exploration Using Serverless SQL Pool In Azure Synapse

In this detailed article, you will learn how to carry out data ingestion and analysis using Azure Synapse Serverless SQL Pool along with sample datasets.

May 31, 2023

by Mohan Krishna Mangamuri

· 2,903 Views · 1 Like

Machine Learning Driving Innovation in the Digital Age

As machine learning (ML) is revolutionizing an enterprise's digital transformation, the path to successful ML implementation comes with its own set of challenges.

May 30, 2023

by Frederic Jacquet

CORE

· 3,421 Views · 1 Like

Seven Steps To Deploy Kedro Pipelines on Amazon EMR

In this post, the author explains how to launch an Amazon EMR cluster and how to deploy a Kedro project to run a Spark job.

May 30, 2023

by Jo Stichbury

CORE

· 3,657 Views · 2 Likes

Competing Consumers With Spring Boot and Hazelcast

Follow a simplified demo implementation of the competing Consumers pattern using Java, Spring Boot, and Hazelcast's distributed queue.

May 29, 2023

by Kyriakos Mandalas

CORE

· 8,367 Views · 12 Likes

How To Use Pandas and Matplotlib To Perform EDA In Python

In this article, we will explore how to use two popular Python libraries, Pandas and Matplotlib, to perform EDA.

May 29, 2023

by Stylianos Kampakis

· 6,461 Views · 5 Likes

What to Pay Attention to as Automation Upends the Developer Experience

The developer experience is changing faster than ever. Here's a primer on where things stand right now and what developers can expect.

May 26, 2023

by Shomron Jacob

· 5,520 Views · 2 Likes

What ChatGPT Needs Is Context

Like the calculator, the spreadsheet, and the internet, AI-driven LLM tools are likely to change HOW we do our work, but not the fact that humans will still be the ones doing the work in the first place.

May 26, 2023

by Leon Adato

· 5,819 Views · 3 Likes

Optimal Use of Snowflake Warehouse and Tables

The third article of this series presents a deep-dive into how efficiently we can use Snowflake tables and warehouses and how data sharing occurs across accounts.

May 26, 2023

by Kedhar Natekar

· 3,994 Views · 1 Like

5 Key Concepts for MQTT Broker in Sparkplug Specification

In the Sparkplug specification, an MQTT broker is an indispensable component to incorporate the five conceptual capabilities.

May 26, 2023

by Tao Dekun

· 4,673 Views · 2 Likes

The Latest Big Data Topics