Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Access OData From Microsoft Power BI

DZone's Guide to

Access OData From Microsoft Power BI

You can leverage OData and start creating valuable visualizations with public data in Microsoft's Power BI in less than 10 minutes.

· Big Data Zone
Free Resource

See how the beta release of Kubernetes on DC/OS 1.10 delivers the most robust platform for building & operating data-intensive, containerized apps. Register now for tech preview.

OData, or “SQL for the Web,” is REST-based and enables anyone to query data over the web. We leveraged SF OpenData’s OData feed and visualized it in PowerBI.

We gave you a guide to OData and information on getting started. But how can you actually use it? Is it as easy as we say it is? We explore these questions with a fun example from San Francisco’s OData feed, “Film Locations in San Francisco.” They are just one of many examples of local and federal governments leveraging open data for transparency and business value. With this tutorial, you can leverage OData and start creating valuable visualizations in under 10 minutes.

Before You Begin

Before you start, you will need a tool that can utilize an OData feed. Some common tools are Excel (2010 and after), Microsoft PowerQuery, and Microsoft PowerBI. Today, I will be using PowerBI. If you would like to follow along, grab a free trial of the tool here.

Note: If you would like to create an OData API for any cloud app or database yourself, learn how in our Hybrid Connectivity page.

Adding Your OData Feed

In the PowerBI interface, you will need to begin by adding your OData Feed. In the home ribbon, click on the box that says Get Data.
Get Data

From here, you will choose OData Feed and will be prompted to add the feed in this box.
OData Feed

Go ahead and add fill in the OData endpoint for SF Open Data, https://data.sfgov.org/OData.svc/yitu-d5am, and hit OK.

Loading and Editing the Data

Once you have your OData endpoint set, you will see a preview screen where you can view the rows and columns you are adding. Hitting the Edit button allows you to parse your data and choose only what you want or need. In my case, I deleted the “fun_facts” column because it was not an easy column to visualize (due to it being non-formatted sentences).

If you are dealing with a larger data set, you will want to be sure to only load the data that you need. Depending on backend architecture (more on this later), the load time can vary dramatically between downloading a whole data set versus a filtered list. Once you are done editing your data, hit the Load button and apply your query.
Editing your data

Once your data has loaded, you should see your data loaded into the data view of PowerBI and it should look like this:
Data view

Visualizing Your Data

Now is the fun part! Let’s visualize the data we just brought in through the OData feed! Due to most of this dataset being string variables, I used treemaps and pie charts. I also leveraged the location data and visualized the filming locations within San Francisco using the map visualization.

Count of Locations by Director

Count of Locations by Director

In this visualization, we can see the top film directors based on the number of locations they have filmed in San Francisco. Notice some of the big names! Woody Allen, the Wachowskis, Clint Eastwood, and Alfred Hitchcock to name a few. When I saw this list, I was curious on why Andrew Haigh had so many film locations as opposed to more popular directors. It turns out that this list also includes episodes for TV shows, and the top positions were shows based in San Francisco.

Count of Locations by Title

Count by Title

In this visualization, we leveraged PowerBI’s pie chart visualization to see which titles had the most filming locations in the city. We can see how this chart relates to the count of locations by director in that Andrew Haigh’s show “Looking” is the most popular on this list. Are any of these movies or shows top on your list to watch soon? Let us know in the comments. I am always looking for suggestions!

Film Locations

In this visualization, I used the “Maps” to visualize all of the data in the location column on Bing Maps. One issue I ran into with this that is worth noting is that the formatting is not formatted like a normal address with numbers and zip codes. This caused some of the locations to be seen in multiple places in the U.S., so I decided to only show the locations that were specifically found in San Francisco.

Film Locations

Experiment With More Public Feeds

If you want more public data feeds to experiment with, the list below is a great place to start! OData.org has a great list as well in their live services section.

  1. Whitehouse Visitor Records
  2. Scottsdale Arizona Municipal Data
  3. Worldbank Open Data
  4. Nerddinner OData Feed
  5. San Francisco Film Location Data
  6. Cambridge Weather

New Mesosphere DC/OS 1.10: Production-proven reliability, security & scalability for fast-data, modern apps. Register now for a live demo.

Topics:
power bi ,odata ,big data ,tutorial ,data visualization ,data analytics

Published at DZone with permission of Austin Hellman, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}