Over a million developers have joined DZone.

Database Concepts for a java Dev: Database Normalization

Build fast, scale big with MongoDB Atlas, a hosted service for the leading NoSQL database. Try it now! Brought to you in partnership with MongoDB.

In this part, I will be briefing about different types of Database Normalizations using a sample data model.

What is Database Normalization?

Normalization is the process of efficiently organizing data in the database.

Primary Goal of Normalization?

Eliminating redundant data & ensuring meaningful data dependencies.

Types of Normalization

The following are the three most common normal forms in the database normalization process

  1. First Normal Form (1NF)
  2. Second Normal Form (2NF)
  3. Third Normal Form (3NF)

Sample Data Model for Demonstration

The following data model will be used to demonstrate all the three normal forms

First Normal Form (1NF)

First Normal Form (1NF) sets the very basic rules for an organized database:

  • Create separate set of tables for each group of related data and identify each row with a unique columns [primary key] or set of columns [composite key]
  • Eliminate duplicate columns from the table

The following data model depicts the tables after 1NF rules are applied -

Second Normal Form (2NF)

Second Normal Form (2NF) further addresses the concept of removing duplicate data:

  • Meet all the requirements of the first normal form
  • Remove subsets of data that apply to multiple rows of a table and place them in separate tables
  • Create relationships between these new tables and their predecessors through the use of foreign keys

So basically the objective of the Second Normal Form is to take that is only partly dependent on the primary key and enter that data into another table.

The following data model depicts the tables after 2NF rules are applied.

Data from EMPLOYEE_TABLE is split into 2 tables – EMPLOYEE_TABLE and EMPLOYEE_HR_TABLE.

Similarly data from CUSTOMER_TABLE is moved to CUSTOMER_TABLE and CUSTOMER_ORDER table

Third Normal Form (3NF)

Third normal form (3NF) goes one large step further:

  • Meet all the requirements of the second normal form.
  • Remove columns that are not dependent upon the primary key.

The following data model depicts the tables after 3NF rules are applied.

Further state and country details are moved to their own tables because they are not dependent on the primary key.

Advantages of Normalizing the Database

There are several advantages of normalization -

  1. Data can be stored as small atomic pieces
  2. Saves space
  3. Increases speed
  4. Reduces data anomalies
  5. Easy maintenance

Other parts of this series include:

Part 1 – ACID Properties

Part 2 – Keys

Part 4 – Database Transactions [coming soon]

Part 5 – Indexes [coming soon]

Now it's easier than ever to get started with MongoDB, the database that allows startups and enterprises alike to rapidly build planet-scale apps. Introducing MongoDB Atlas, the official hosted service for the database on AWS. Try it now! Brought to you in partnership with MongoDB.


Published at DZone with permission of Jagadeesh Motamarri, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}