Swathi V

About Swathi V

Loves Art and Technology! Would like to blog and share.. Involved in Apache Hadoop and its ecosystem. Eager to be a part of Big Data Revolution.

Apache Bigtop – Installing Hive, HBase and Pig

In the previous post we learnt how easy it was to install Hadoop with Apache Bigtop!
We know its not just Hadoop and there are sub-projects around the table! So, lets have a look at how to install Hive, Hbase and Pig in this post.

Before rowing your boat

Please follow the previous post and get ready with Hadoop installed!
Follow the link for previous post:
http://femgeekz.blogspot.in/2012/06/hadoop-hangover-introduction-to-apache.html
also, the same can be found at DZone, developer site: http://www.dzone.com/links/hadoop_hangover_introduction_to_apache_bigtop_and.html

All Set?? Great! Head On..
Make sure all the services of Hadoop are running. Namely, JobTracker, SecondaryNameNode, TaskTracker, DataNode and NameNode. [standalone mode]

Hive with Bigtop:

The steps here are almost the same as Installing Hive as a separate project.
However, few steps are reduced.
The Hadoop installed in the previous post is Release 1.0.1

We had installed Hadoop with the following command

sudo apt-get install hadoop\*
                    

Step 1: Installing Hive

We have installed Bigtop 0.3.0, and so issuing the following command installs all the hive components.
ie. hive, hive-metastore, hive-server. The daemons names are different in Bigtop 0.3.0.

sudo apt-get install hive\*

This installs all the hive components. After installing, the scripts must be able to create /tmp and /usr/hive/warehouse and HDFS doesn’t allow these to be created while installing as it is unaware of the path to Java. So, create the directories if not created and grant the execute permissions.
In the hadoop directory, ie. /usr/lib/hadoop/

bin/hadoop fs -mkdir /tmp
bin/hadoop fs -mkdir /user/hive/warehouse
bin/hadoop -chmod g+x /tmp
bin/hadoop -chmod g+x /user/hive/warehouse

Step 2: The alternative directories could be /var/run/hive and /var/lock/subsys

sudo mkdir /var/run/hive
sudo mkdir /var/lock/subsys

Step 3: Start the hive server, a daemon

sudo /etc/init.d/hive-server start

Image:

Step 4: Running Hive
Go-to the directory /usr/lib/hive.
See the Image below:
bin/hive

Step 5: Operations on Hive

Image:

HBase with Bigtop:

Installing Hbase is similar to Hive.

Step 1: Installing HBase

sudo apt-get install hbase\*

Image:

Step 2: Starting HMaster

sudo service hbase-master start

Image:

Image:

Step 3: Starting HBase shell

hbase shell          

Image:

Step 4: HBase Operations
Image:

Image:

Pig with Bigtop:

Installing Pig is similar too.

Step 1: Installing Pig

sudo apt-get install pig

Image:

Step 2: Moving a file to HDFS
Image:

Step 3: Installed Pig-0.9.2
Image:

Step 4: Starting the grunt shell

pig

Image:

Step 5: Pig Basic Operations
Image:

Image:

We saw that is it possible to install the subprojects and work with Hadoop, with no issues.
Apache Bigtop has its own spark! :)

There is a release coming BIGTOP-0.4.0 which is supposedly to fix the following issues:

https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12318889&styleName=Html&projectId=12311420

Source and binary files:

http://people.apache.org/~rvs/bigtop-0.4.0-incubating-RC0

Maven staging repo:

https://repository.apache.org/content/repositories/orgapachebigtop-279

Bigtop’s KEYS file containing PGP keys we use to sign the release:

http://svn.apache.org/repos/asf/incubator/bigtop/dist/KEYS

Let us see how to install other sub-projects in the coming posts!
Until then, Happy Learning!

Reference: Hadoop Hangover: Introduction To Apache Bigtop and Installing Hive, HBase and Pig from our JCG partner Swathi V at the * Techie(S)pArK * blog.

Do you want to know how to develop your skillset to become a Java Rockstar?

Subscribe to our newsletter to start Rocking right now!

To get you started we give you two of our best selling eBooks for FREE!

JPA Mini Book

Learn how to leverage the power of JPA in order to create robust and flexible Java applications. With this Mini Book, you will get introduced to JPA and smoothly transition to more advanced concepts.

JVM Troubleshooting Guide

The Java virtual machine is really the foundation of any Java EE platform. Learn how to master it with this advanced guide!

Given email address is already subscribed, thank you!
Oops. Something went wrong. Please try again later.
Please provide a valid email address.
Thank you, your sign-up request was successful! Please check your e-mail inbox.
Please complete the CAPTCHA.
Please fill in the required fields.

Leave a Reply


3 + = eight



Java Code Geeks and all content copyright © 2010-2014, Exelixis Media Ltd | Terms of Use | Privacy Policy
All trademarks and registered trademarks appearing on Java Code Geeks are the property of their respective owners.
Java is a trademark or registered trademark of Oracle Corporation in the United States and other countries.
Java Code Geeks is not connected to Oracle Corporation and is not sponsored by Oracle Corporation.
Do you want to know how to develop your skillset and become a ...
Java Rockstar?

Subscribe to our newsletter to start Rocking right now!

To get you started we give you two of our best selling eBooks for FREE!

Get ready to Rock!
You can download the complementary eBooks using the links below:
Close