Big Data/Analytics Zone is brought to you in partnership with:
  • submit to reddit
Alec Noller09/17/14
1488 views
0 replies

Dev of the Week: Chanwit Kaewkasi

This week we're talking to Chanwit Kaewkasi, Assistant Professor at the Suranaree University of Technology’s School of Computer Engineering in Thailand, co-developer of a series of low-cost Big Data clusters, and featured author in DZone's upcoming 2014 Guide to Big Data.

Chris Odell09/17/14
2 views
0 replies

Why I will Always Try And Find A Ready-Built Library

By the time you have developed something and fixed any issues with it, your version is simply not going to be as tested as a ready built component that is used by thousands of people.

Alec Noller09/15/14
1330 views
0 replies

Join Us For Our Big Data Twitter Q&A! #DZBigData

In anticipation of our 2014 Guide to Big Data, we have arranged for a panel of experts - Kirk Borne, Carla Gentry, Jonathan Ellis, and James G. Kobielus, to answer your Big Data questions on Twitter on Monday, September 22, 2014. To participate, simply ask a question using the hashtag #DZBigData.

Drew Harvey09/15/14
2782 views
0 replies

DB2 CONCAT (Concatenate) Function

The DB2 CONCAT function will combine two separate expressions to form a single string expression. It can leverage database fields, or explicitly defined strings as one or both expression when concatenating the values together.

Rob J Hyndman09/12/14
4658 views
0 replies

Generating quantile forecasts in R

A “quan­tile fore­cast” is a quan­tile of the fore­cast dis­tri­b­u­tion. Still assum­ing nor­mal­ity, we could gen­er­ate the fore­cast quan­tiles from 1% to 99% in R using...

Rick Delgado09/12/14
503 views
0 replies

How to Educate Employees on Keeping Data Safe

Computer Security breaches can end up costing even the average small business up to $200,000

Ajitesh Kumar09/11/14
1398 views
0 replies

How to Start a Big Data Practice

This article represents key aspects of starting up a Big Data practice in your organization. Currently, I have started working in the same area and this blog is the result of my research. Hope you find it useful.

Kai Wähner09/11/14
5354 views
0 replies

Comparison of Alternatives for Stream Processing and Streaming Analytics

The article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), when stream processing makes sense, and what technologies and products you can choose from. Comparison of open source and proprietary stream processing / streaming analytics alternatives: Apache Storm, Spark, IBM InfoSphere Streams, TIBCO StreamBase, Software AG's Apama, etc.

Alec Noller09/10/14
5340 views
0 replies

Dev of the Week: Adam Diaz

Every week at DZone, we feature a new developer/blogger to catch up and find out what he or she is working on now and what's coming next. This week we're talking to Adam Diaz, Hadoop Architect at the Teradata Big Data Center of Excellence and featured author in DZone's upcoming 2014 Guide to Big Data.

G. Ryan Spain09/05/14
3173 views
0 replies

Stinger.next: The Future of SQL in Hadoop

Hortonworks’ Stinger Initiative, which finished rolling out in April, expanded on the Hive engine to allow for interactive SQL queries at the Hadoop scale. Now Hortonworks has announced their next set of objectives for Hive, which they are calling Stinger.next.

G. Ryan Spain09/05/14
4707 views
0 replies

Changing Our Views on Using and Analyzing Big Data with Hadoop

In 2006, Hadoop became one predominant solution in the world of Big Data, and it remains a major player for processing Big Data today. But as needs for Big Data analysis expand and evolve, some analysts and developers consider Hadoop unable to perform to their standards.

Maarten Ectors09/05/14
3666 views
0 replies

Instant Big Data Stream Processing = Instant Storm

Every 6 months at Canonical, the company behind Ubuntu, I work on something technical to test our tools first hand and to show others new ideas. This time around I created an Instant Big Data solution, more concretely “Instant Storm”.

Mark Needham09/04/14
5719 views
0 replies

R: dplyr - group_by dynamic or programmatic field

In my last blog post I showed how to group timestamp based data by week, month and quarter. I wanted to pull this code out into a function. It turns out if we want to do this then we actually want the regroup function rather than group_by:

Trevor Parsons09/04/14
3256 views
0 replies

What is Syslog?

Syslog has been around for a number of decades and provides a protocol used for transporting event messages between computer systems and software applications. The protocol utilizes a layered architecture, which allows the use of any number of transport protocols for transmission of syslog messages.

G. Ryan Spain09/04/14
1294 views
0 replies

Big Data - Link Roundup - September 4, 2014

Links to Big Data Articles and Information, with recent articles on real-world applications of Big Data analysis, thoughts on new and different ways to look at Big Data, and tools for starting Big Data analysis.

Mark Needham09/04/14
2587 views
0 replies

R: ggplot - Cumulative frequency graphs

The first step was to transform the data so that I had a data frame where a row represented a day where a member joined the group. To turn that into a chart we can plug it into ggplot and use the cumsum function to generate a line showing the cumulative total:

Alec Noller09/03/14
7919 views
0 replies

The Best of DZone: August 27 - September 3

If you missed anything on DZone this week, now's your chance to catch up! This week's best include the anatomy of Hibernate dirty checking, the similarities of Swift and Scala, the Agile version of Superman vs. Batman, and more.

Mark Needham09/03/14
4478 views
0 replies

R: Grouping by week, month, quarter

In my continued playing around with R and meetup data I wanted to have a look at when people joined the London Neo4j group based on week, month or quarter of the year to see when they were most likely to do so.

Kin Lane09/03/14
2838 views
0 replies

6,482 Datasets Available Across 22 Federal Agencies In Data.json Files

A list of 22 federal agencies who have published data.json files.

Jennifer Wright09/03/14
990 views
0 replies

Big Data, Big Value

How valuable is big data? It’s an important question for developers, who need to be able to respond to ever-shifting markets quickly so they are not left behind.

Anders Abel09/02/14
3448 views
0 replies

A Geek's Nightmare

Last night I woke up after a night mare. A nightmare containing a future, “improved” version of powershell a competing blogger and Entity Framework Migrations. Slightly off topic, but I’ll share it anyway.

Rob J Hyndman08/29/14
996 views
0 replies

Forecasting with R in WA

On 23–25 Sep­tem­ber, I will be run­ning a 3-​​day work­shop in Perth on “Fore­cast­ing: prin­ci­ples and prac­tice” mostly based on my book of the same name.

Kai Wähner08/29/14
730 views
0 replies

Intelligent Business Process Management Suites (iBPMS) - The Next-Generation BPM for a Big Data World

I had a talk at ECSA 2014 in Vienna: The Next-Generation BPM for a Big Data World: Intelligent Business Process Management Suites (iBPMS), sometimes also abbreviated iBPM. I want to share the slides with you.

Mark Needham08/27/14
4098 views
0 replies

R: Rook - Hello world example - 'Cannot find a suitable app in file'

I’ve been playing around with the Rook library and struggled a bit getting a basic Hello World application up and running so I thought I should document it.

Mikio Braun08/27/14
4156 views
0 replies

Big Data & Machine Learning Convergence

As these two fields converge, work has to be done to provide the right set of mechanisms and abstractions. Right now I still think there is a considerable gap which we need to close over the next few years.