Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Introducing Federated Analytics

DZone's Guide to

Introducing Federated Analytics

· Big Data Zone ·
Free Resource

Learn how to operationalize machine learning and data science projects to monetize your AI initiatives. Download the Gartner report now.

federated analytics

Federated analytics is a term I coined up to identify a specific capability offered by a data analytics platform. Federated analytics is the capability of joining various, distributed data sources and performing analytics as if they were a single data source.

If you consider a case where you have http access logs, a customer details spreadsheet and a live stream coming from an API gateway or an ESB. One possibility would be to combine the data in these three sources and understand in real time which of your customers are accessing your services through which services and from what location. If you consider combinations alone (based on the fields available in the data source), the numbers are daunting even with three data sources. What if there were 10s or 100s. With federated analytics, the capabilities that comes to understanding your data and even figuring out hidden trends becomes much easier and accessible, for an organization of any size.

Bias comes in a variety of forms, all of them potentially damaging to the efficacy of your ML algorithm. Our Chief Data Scientist discusses the source of most headlines about AI failures here.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}