Theodora Fragkouli

About Theodora Fragkouli

Theodora has graduated from Computer Engineering and Informatics Department in the University of Patras. She also holds a Master degree in Economics from the National and Technical University of Athens. During her studies she has been involved with a large number of projects ranging from programming and software engineering to telecommunications, hardware design and analysis.

Spring for Apache Hadoop 2.0 M5

Spring has happily announced the Spring for Apache Hadoop 2.0 M5 milestone releases, while they are also getting much closer to a release candidate. In the Spring blog there is a good comparison between the new version 2.0 and the 1.0 version. According to it:

1.0 version of Spring for Apache Hadoop uses HDFS and MapReduce with either MapReduce v1 or MapReduce v2 (YARN). The default distribution is Apache Hadoop 1.2.1 with additional features, like Hadoop 2.2.0, Pivotal HD 1.1, Cloudera CDH4 MR1 or MR2 YARN and Hortonworks HDP 1.3.

On the other hand, Spring for Apache Hadoop 2.0 focuses in adding YARN application development support in addition to continue to provide improvements in the HDFS and MapReduce support. The default distribution for the 2.0 releases is Apache Hadoop 2.2.0.

Below you can see the specific artifacts with their respective transitive dependencies in the Spring IO milestone repository:

  • 2.0.0.M5 (default – Apache Hadoop stable 2.2.0)
  • 2.0.0.M5-hadoop12 (Apache Hadoop stable 1.2.1)
  • 2.0.0.M5-phd1 (Pivotal HD 1.1)
  • 2.0.0.M5-cdh4 (Cloudera CDH4 MR1)
  • 2.0.0.M5-cdh5 (Cloudera CDH5 YARN beta)
  • 2.0.0.M5-hdp20 (Hortonworks HDP 2.0)

Spring for Apache Hadoop 2.0 version also offers:

  • The support for YARN features in the new spring-yarn sub-project. With the spring-yarn framework Spring based applications can be developed and they can be deployed to run on Hadoop v2 using YARN.
  • YARN support with Spring Boot. Now Spring Boot applications can be deployed on YARN.
  • Annotation based programming model and annotation based configuration for Hadoop YARN features.

All new YARN features are in the YARN samples, so you can check them out there.

A spring-data-hadoop-store sub-project is also here to provide better support for writing data to HDFS using DataWriter and DataReader implementation supporting formats like text files and SequenceFiles with or without compression. The new sub-project also integrates with the Dataset support from Kite SDK.

For more project specific information please see the project page.

Related Whitepaper:

Functional Programming in Java: Harnessing the Power of Java 8 Lambda Expressions

Get ready to program in a whole new way!

Functional Programming in Java will help you quickly get on top of the new, essential Java 8 language features and the functional style that will change and improve your code. This short, targeted book will help you make the paradigm shift from the old imperative way to a less error-prone, more elegant, and concise coding style that’s also a breeze to parallelize. You’ll explore the syntax and semantics of lambda expressions, method and constructor references, and functional interfaces. You’ll design and write applications better using the new standards in Java 8 and the JDK.

Get it Now!  

Leave a Reply

+ six = 14

Java Code Geeks and all content copyright © 2010-2014, Exelixis Media Ltd | Terms of Use | Privacy Policy
All trademarks and registered trademarks appearing on Java Code Geeks are the property of their respective owners.
Java is a trademark or registered trademark of Oracle Corporation in the United States and other countries.
Java Code Geeks is not connected to Oracle Corporation and is not sponsored by Oracle Corporation.
Do you want to know how to develop your skillset and become a ...
Java Rockstar?

Subscribe to our newsletter to start Rocking right now!

To get you started we give you two of our best selling eBooks for FREE!

Get ready to Rock!
You can download the complementary eBooks using the links below: