Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

On-Demand, Service-Based Big Data Integration for Collaboration [Slides]

DZone's Guide to

On-Demand, Service-Based Big Data Integration for Collaboration [Slides]

This research paper and slideshow tackle the challenges Big Data integration faces today and architects a possible solution for the problems.

Free Resource

See how the beta release of Kubernetes on DC/OS 1.10 delivers the most robust platform for building & operating data-intensive, containerized apps. Register now for tech preview.

Today, I presented my paper "Óbidos" at the VLDB DMAH workshop in Munich. The abstract and the presentation of the paper are given below.

Abstract

Biomedical research requires distributed access, analysis, and sharing of data from various disparate sources on an Internet scale. Due to the volume and variety of Big Data, materialized data integration is often infeasible or too expensive, considering the costs of bandwidth, storage, maintenance, and management. Óbidos (On-demand Big Data Integration, Distribution, and Orchestration System) provides a novel on-demand integration approach for heterogeneous distributed data. Instead of integrating data from the data sources to build a complete data warehouse as the initial step, Óbidos employs a hybrid approach of virtual and materialized data integrations. By allocating unique identifiers as pointers to virtually integrated data sets, Óbidos supports efficient data sharing among data consumers. We designed Óbidos as a generic service-based data integration system and implemented and evaluated a prototype for multimodal medical data.

Please find the full text of the paper here and the presentation below:

I mostly worked on this paper while I was doing my internship at Emory University. This is also my first paper to get accepted from UCLouvain/Belgium.

New Mesosphere DC/OS 1.10: Production-proven reliability, security & scalability for fast-data, modern apps. Register now for a live demo.

Topics:
big data ,big data integration ,distributed systems ,distributed data

Published at DZone with permission of Pradeeban Kathiravelu, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}