Over a million developers have joined DZone.

Introducing Federated Analytics

· Big Data Zone

Hortonworks DataFlow is an integrated platform that makes data ingestion fast, easy, and secure. Download the white paper now.  Brought to you in partnership with Hortonworks

federated analytics

Federated analytics is a term I coined up to identify a specific capability offered by a data analytics platform. Federated analytics is the capability of joining various, distributed data sources and performing analytics as if they were a single data source.

If you consider a case where you have http access logs, a customer details spreadsheet and a live stream coming from an API gateway or an ESB. One possibility would be to combine the data in these three sources and understand in real time which of your customers are accessing your services through which services and from what location. If you consider combinations alone (based on the fields available in the data source), the numbers are daunting even with three data sources. What if there were 10s or 100s. With federated analytics, the capabilities that comes to understanding your data and even figuring out hidden trends becomes much easier and accessible, for an organization of any size.

Learn how you can modernize your data warehouse with Apache Hadoop. View an on-demand webinar now. Brought to you in partnership with Hortonworks.

Topics:

Published at DZone with permission of Tharindu Mathew, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

SEE AN EXAMPLE
Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.
Subscribe

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}