Top 10 June '21 Big Data Articles to Read Now
See the 10 most popular articles from the Big Data zone with topics covering Kafka, useful queries in Elasticsearch, PySpark joins, Hadoop shell commands, and much more!
Join the DZone community and get the full member experience.Join For Free
Big Data is now adapted by a lot of businesses. Its popularity and use are expanding globally. How awesome would it be to find top trending Big Data articles in one place so that you can always stay up to date with the latest trends in technology? We dug into Google analytics to find the top 10 most popular Big Data articles in June. Let's get started!
10. Kafka Administration and Monitoring UI Tools
Kafka is used for streaming data and much more! This article covers Kafka basics and Kafta Administration, Kafka Manager, and Monitoring tools.
9. Spring Boot and Kafka Configuration Tuning
This article covers setup for configuration tuning in an isolated environment and determines the Spring Boot, Kafka configuration, and best practices for moderate uses. Follow the step-by-step guide to tune your application.
8. Splitting CSV Files in Python
A large number of data stored in CSV files? Why not split them to track the data better! Refer to this article and split large CSV files using Python.
7. What Is Kafka? Everything You Need to Know
Kafka is often used in real-time streaming data architectures to provide real-time analytics. Learn its specific use cases and why it's exploding in popularity.
6. 23 Useful Elasticsearch Example Queries
Elasticsearch is used to store document-oriented and semi-structured data. Get to know the queries for Elasticsearch to make the best of Elastosearch.
5. PySpark Join Explained
PySpark provides multiple ways to combine data frames i.e. join, merge, union, SQL interface, etc. Similar to SQL join, this article explores more about PySpark Join and how to use it effectively to make the best of it.
4. Top 10 Hadoop Shell Commands to Manage HDFS
Are you aware of Hadoop? Here are the top 10 basic Hadoop HDFS operations managed through shell commands which are useful to manage files on HDFS clusters.
3. Learn R: How to Extract Rows and Columns From Data Frame
Learn command set in the R programming language, which could be used to extract rows and columns from a given data frame.
2. What Is Data Profiling?
Do you ever read a lot of raw data and summarize information based on it? Learn the meaning of Data Profiling. Know more on how to profile data and the challenges that occur in data profiling.
1. Setting Up and Running Apache Kafka on Windows OS
Need Apache Kafka on your windows machine? Here is a step-by-step guide to installing and running Apache ZooKeeper and Apache Kafka on a Windows OS.
Are we missing any of the latest topics in our Big Data Zone? If you have anything to share with our community, simply create a free account and click on the “Post” button in the top right of the navigation bar once logged in. For a better chance at having your article published, check out our Article Submission Guidelines, which provide information on what we can and cannot accept, as well as additional resources to help get you started.
Opinions expressed by DZone contributors are their own.