Over a million developers have joined DZone.

Bulk Loading UK Crimes Data

DZone's Guide to

Bulk Loading UK Crimes Data

· Big Data Zone
Free Resource

Learn how you can maximize big data in the cloud with Apache Hadoop. Download this eBook now. Brought to you in partnership with Hortonworks.

Yesterday, the UK Police has launched a hub for accessing their data. Read the announcement here: http://www.data.gov.uk/blog/the-launch-of-datapoliceuk

You can find it here: http://data.police.uk/

The data extracts are here: http://data.police.uk/data/

I wanted to have a look at this and see what can be done with it, but there are dozens of csv files to load in to get the full view, so this is a quick post on what I’ve been doing this evening. :-)

Having found an interesting post on loading multiple CSV files in T-SQL , I’ve updated it to load the files in. First though, we need to create a table to take the data.

Create the Crime_Info table

An initial table to receive the data is shown below:

CREATE TABLE [dbo].[Crime_Info]( 
  [Crime ID] [varchar](100) NULL, 
  [Month] [varchar](100) NULL, 
  [Reported by] [varchar](100) NULL, 
  [Falls within] [varchar](100) NULL, 
  [Longitude] [float] NULL, 
  [Latitude] [float] NULL, 
  [Location] [varchar](100) NULL, 
  [LSOA code] [varchar](100) NULL, 
  [LSOA name] [varchar](100) NULL, 
  [Crime type] [varchar](200) NULL, 
  [Last outcome category] [varchar](max) NULL, 
  [Context] [varchar](max) NULL 

Note that I Page compressed the table as I wanted to reduce the space it took up. It went down from 3GB to around 1.5GB.

Load the Crime files

I’ve modified the bulk load script mentioned above, and you can download it here:


Once loaded, I had 15,668,549 rows of data, sitting in a 1.2GB database.

Then, by playing with PowerPivot you can create ‘fun’ visualisations like this.


Have fun, and as they say on Crimewatch: “Don’t have nightmares, do sleep well.”

Hortonworks DataFlow is an integrated platform that makes data ingestion fast, easy, and secure. Download the white paper now.  Brought to you in partnership with Hortonworks


Published at DZone with permission of Nick Haslam, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}