Home » Tag Archives: Apache HBase

Tag Archives: Apache HBase

Seven Databases in Seven Weeks – Hbase Day 2

apache-hbase-logo

This post is a recap of the second day of Hbase from the Seven Databases in Seven Weeks book. Most of the commands and scripts can be found at GitHub: https://github.com/eyalgo/seven-dbs-in-seven-weeks/tree/master/hbase/day_2 Streaming Script The first thing in day 2 was to download lots of data (big data) and stream it into Hbase. There’s a JRuby script, which I had to ...

Read More »

Apache Karaf meets Apache HBase

apache-hbase-logo

Introduction Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google’s Bigtable. If you are a regular reader most probably you already know what Apache Karaf is, but for those who are not: Apache Karaf is an OSGi runtime that runs on top of any OSGi framework and provides you a set of services, a powerful provisioning concept, an extensible ...

Read More »

What is Big Data – Theory to Implementation

jcg-logo

What is Big Data? You may ask; and more importantly why it is the latest trend in nearly every business domain? Is it just a hype or its here to stay? As a matter of fact “Big Data” is a pretty straightforward term – its just what its says – a very large data-set. How large? The exact answer is ...

Read More »

Apache Bigtop – Installing Hive, HBase and Pig

apache-bigtop-logo

In the previous post we learnt how easy it was to install Hadoop with Apache Bigtop! We know its not just Hadoop and there are sub-projects around the table! So, lets have a look at how to install Hive, Hbase and Pig in this post. Before rowing your boat… Please follow the previous post and get ready with Hadoop installed! ...

Read More »

A SMALL cross-section of BIG Data

apache-cassandra-logo

Big data is a term applied to data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Big data sizes are a constantly moving target currently ranging from a few dozen terabytes to many petabytes of data in a single data set. IDC estimated the ...

Read More »

Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase comparison

apache-cassandra-logo

Kristóf Kovács performed a very interesting brief comparison among six of the most popular “NoSQL” database implementations. Additionally to Kristóf’s work, I would like to provide some links which I believe will help all of you who are interested in following the “NoSQL” community along : NoSQLDatabases – http://www.nosqldatabases.com/ myNoSQL – http://nosql.mypopescu.com/ NoSQL – http://nosql-database.org/ Best Regards Justin

Read More »
Want to take your Java Skills to the next level?
Grab our programming books for FREE!
  • Save time by leveraging our field-tested solutions to common problems.
  • The books cover a wide range of topics, from JPA and JUnit, to JMeter and Android.
  • Each book comes as a standalone guide (with source code provided), so that you use it as reference.
Last Step ...

Where should we send the free eBooks?

Good Work!
To download the books, please verify your email address by following the instructions found on the email we just sent you.