List/Grid Tag Archives: MapReduce

MapReduce for dummies
Continuing the coverage on Hadoop component, we will go through the MapReduce component. MapReduce is a concept that has been programming model of LISP. But before we jump into MapReduce, ...

Joins with Map Reduce
I have been reading on Join implementations available for Hadoop for past few days. In this post I recap some techniques I learnt during the process. The joins can be done at both Map ...

Word Count MapReduce with Akka
In my ongoing workings with Akka, i recently wrote an Word count map reduce example. This example implements the Map Reduce model, which is very good fit for a scale out design approach. ...

A SMALL cross-section of BIG Data
Big data is a term applied to data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Big ...

Big Data analytics with Hive and iReport
Each J.J. Abrams’ TV series Person of Interest episode starts with the following narration from Mr. Finch one of the leading characters: “You are being watched. The government ...

Hadoop: A Soft Introduction
What is Hadoop:Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File ...

MapReduce: A Soft Introduction
MapReduce is a parallel programming technique made popular by Google. It is used for processing very large amounts of data. Such processing can be completed in a reasonable amount ...


