List/Grid Tag Archives: Big Data

Hadoop setup on single node and multi node
We will describe Hadoop setup on single node and multi node. The Hadoop environment setup and configuration will be described in details. First you need to download the following ...

Monitoring S3 uploads for a real time data
If you are working on Big Data and its bleeding edge technologies like Hadoop etc., the primary thing you need is a “dataset” to work on. So, this data can be reviews, blogs, ...

Hope vs. Motivation: Why Big Data needs empathy and emotion
Because, – says Om Malik, one of the most extraordinary thinkers on Silicon Valley and the founder of GIGAOM - The problem with data is that the way it is used today, it lacks ...

Big Data 2013 Predictions
If you just invested a lot of money in a Big Data solution from any of the traditional BI vendors (Teradata, IBM, Oracle, SAS, EMC, HP, etc.) then you are likely to see a sub-optimal ...

Hadoop Hangover: Launch a hadoop cluster CDH4 using Apache Whirr
This post is about how-to launch a CDH4 MRv1 or CDH4 Yarn cluster on EC2 instances. It’s said that you can launch a cluster with the help of Whirr and in a matter of 5 minutes! ...

MapReduce Algorithms – Secondary Sorting
We continue with our series on implementing MapReduce algorithms found in Data-Intensive Text Processing with MapReduce book. Other posts in this series:Working Through Data-Intensive ...

Calculating A Co-Occurrence Matrix with Hadoop
This post continues with our series of implementing the MapReduce algorithms found in the Data-Intensive Text Processing with MapReduce book. This time we will be creating a word co-occurrence ...

How to be Big Data-native?
Big data has spawned a set of tools that deliver results beyond the buzz. It has started delivering real insights for companies, which result in more effective decisions. When middleware ...

Hadoop + Amazon EC2 – An updated tutorial
There is an old tutorial placed at Hadoop’s wiki page: http://wiki.apache.org/hadoop/AmazonEC2, but recently I had to follow this tutorial and I noticed that it doesn’t ...


