Over a million developers have joined DZone.
Platinum Partner

Presentation: Scalability Challenges in Big Data Science

· Big Data Zone

The Big Data Zone is presented by Exaptive.  Learn how rapid data application development can address the data science shortage.

Scalability Challenges in Big Data Science

Yesterday I gave a talk on scalability and machine learning at the BerlinBuzzword conference. I give an overview of different ways to scale data analysis and machine learning methods. I cover MapReduce (of course), large scale training of SVMs via stochastic gradient descent, but also stream mining, and real-time (as you know, “you don’t just scale into real-time”).

The conference continues today, follow the conference on Twitter on the #bbuzz hashtag.

Update: On scribd, the hyperlinks are somehow lost, so here is the list:

Scalable Databases

Multithreadding and Messaging Frameworks

MapReduce

Large Scale Classifier Training

Other frameworks

Stream processing

TWIMPACT:

 

The Big Data Zone is presented by Exaptive.  Learn about how to rapidly iterate data applications, while reusing existing code and leveraging open source technologies.

Topics:

Published at DZone with permission of Mikio Braun , DZone MVB .

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}