Big Data/BI Zone is brought to you in partnership with:
  • submit to reddit
Eric Genesky06/19/13
107 views
0 replies

Big Data in Practice with Cassandra

Big Data is a fast growing trend in enterprise applications that comes with a novel promise compare to past technological revolutions . . .

Kay Cichini06/19/13
109 views
0 replies

Use R to Bulk-Download Digital Elevation Data with 1" Resolution

Here's a little r-script to convenientely download high quality digital elevation data, i.e. for the Alps, from HERE . . .

Trevor Parsons06/19/13
50 views
0 replies

Musings from an AWS Meetup

After opening up our new Boston office earlier this year (for any of you locals we’re down in the innovation district on Summer St) we finally got the chance to attend out first AWS Boston meetup.

Christopher Taylor06/18/13
354 views
0 replies

Analytics need for speed can cause you to crash and burn

Technology is allowing us to harness big data and understand it in milliseconds but will this quest for speed be your ultimate undoing?

Maarten Ectors06/18/13
581 views
0 replies

Presto – Facebook's Exabyte-Scale Query Engine

Presto is an ANSI-SQL compatible real-time data warehouse query engine so existing data tools should be working with it unlike Hive which needed special integration.

Pushpalanka Jay...06/18/13
480 views
0 replies

Useful Commands to Deal with SVN

The commands I came across with, while working with svn in Linux.

Eric Genesky06/18/13
49 views
0 replies

DevOps and Security

Helen Bravo, of the Open Web Application Security Project, presents a 35-minute discussion at Snowfroc 2013.

Justin Bozonier06/17/13
2367 views
0 replies

Fuzzy Puzzles: Having My Baby

A friend at work, Drew Fustin, proposed this puzzle in our group chat one day as I was meandering on about Bayesian shiny things.

Eric Gregory06/17/13
1025 views
0 replies

Building a Data Science Platform in Scala

John A. De Goes, CTO of Precog, discusses PrecogDB -- a data science platform in Scala.

Ravi Kalakota06/17/13
1443 views
0 replies

NSA PRISM – The Mother of all Big Data Projects

As a data engineer and scientist, I have been following the NSA PRISM raw intelligence mining program with great interest. The engineering complexity, breadth and scale is simply amazing compared to say credit card analytics (Fair Issac) or marketing analytics firms like Acxiom.

Arthur Charpentier06/17/13
1247 views
0 replies

Visualizing Densities of Spatial Processes

We recently uploaded a revised version of our work, with Ewen Gallic on Visualizing spatial processes using Ripley’s correction: an application to bodily-injury car accident location.

Eric Gregory06/16/13
1410 views
0 replies

Data Science and Predictive Modeling at LinkedIn

Monica Rogati, Senior Data Scientist at LinkedIn, discusses data science and predictive modeling.

Anand Epl06/16/13
1380 views
0 replies

OCAJP 7 Object Lifecycle in Java

In the real-world, we can find so many objects around us, for example Cars, Birds, Humans etc. All these objects have a state and behavior. If we consider a Car then it have some data speed, lights on, direction, etc. and have some actions turn right, accelerate, turn lights on, etc.

Kai Wähner06/15/13
1986 views
0 replies

How to Create intelligent Business Processes Thanks to Big Data

BPM is established, tools are stable, many companies use it successfully. However, today’s business processes are based on data from relational databases or web services.

Pieter Humphey06/15/13
1712 views
0 replies

Targeting Big Data: Spring XD 1.0 Milestone 1 Released

Spring XD makes it easy to solve common big data problems such as data ingestion and export, real-time analytics, and batch workflow orchestration.

Todd Homa06/14/13
1728 views
0 replies

Cassandra Bulk CDC Extract

The development team moved their persistence layer from Oracle to Cassandra. How does the Data Warehouse team extract data for reporting?

John Cook06/14/13
2041 views
0 replies

How Many Lights Can You Turn On?

Suppose you have a large n × n grid of lights, some turned on and some turned off. Along the side of each row is a switch that can toggle the lights in that row, turning on lights that were originally off and vice versa. There are similar switches along the top that can toggle the lights in each column. How many lights can you turn on?

Eric Gregory06/14/13
1360 views
0 replies

How Data Scientists Solve Problems

This fifteen minute video from Troy Sadkowsky explores how data scientists approach problem-solving -- starting with recognizing your problem for what it is.

Nitin Aggarwal06/14/13
193 views
0 replies

Running Mediator Instances Issue - Oracle SOA 11g

We encountered an issue with one of our clients when the SOA Purge wasn’t being very effective due to the running mediator instances even though the rest of the flow trace had completed, This wasn’t an issue for business as such however in most cases caused them to fall out of the criteria for Purge due to the state in which these mediator instances were in.

John Cook06/13/13
1532 views
0 replies

Computing Skewness and Kurtosis in One Pass

If you compute the standard deviation of a data set by directly implementing the definition, you’ll need to pass through the data twice: once to find the mean, then a second time to accumulate the squared differences from the mean.

Nishant Chandra06/13/13
1741 views
0 replies

Graph Analytics: Discovering the Undiscovered!

Graph analysis and big data are overlapping areas and then I came across this piece of text which beautifully summarizes the difficulty of discovering the unknown.

color zhang06/13/13
306 views
0 replies

Getting Started with Oracle Event Processing 11g

You maybe have read CQL paper of Stanford STREAM research project. You may be inspired by the ideas of the latest ACM DEBS conference. It’s time to build and implement EDA applications to leverage the magic of event processing.

Nitin Aggarwal06/13/13
185 views
0 replies

SOA 11g SOA Infra DB states for SOA Composites and Components

I found this extremely useful list of states for SOA 11g composites and various components in the SOA INFRA DB which we have benefited greatly in various engagements, so I thought it was worth sharing with you all.

Christopher Taylor06/12/13
1163 views
0 replies

Streaming Data is the New Technology Frontier

What I think this merger represents is the creation of really a one-stop shop for everything that you need to deal with events and become an event-driven, real-time enterprise.

Mihai Dinca - P...06/12/13
1850 views
0 replies

NextReports 6.1 Released

NextReports Suite has reached version 6.1. Users can now create a CSV data source to create queries, reports and charts on data inside text files.