Home » Tag Archives: Apache Cassandra

Tag Archives: Apache Cassandra

Tuning Hadoop & Cassandra : Beware of vNodes, Splits and Pages

apache-cassandra-logo

When running Hadoop jobs against Cassandra, you will want to be careful about a few parameters. Specifically, pay special attention to vNodes, Splits and Page Sizes. vNodes were introduced in Cassandra 1.2. vNodes allow a host to have multiple portions of the token range.  This allows for more evenly distributed data, which means nodes can share the burden of a ...

Read More »

High-Performance Computing Clusters (HPCC) and Cassandra on OS X

apache-cassandra-logo

Our new parent company, LexisNexis, has one of the world’s largest public records database: “…our comprehensive collection of more than 46 billion records from more than 10,000 diverse sources—including public, private, regulated, and derived data. You get comprehensive information on approximately 269 million individuals and 277 million unique businesses.” http://www.lexisnexis.com/en-us/products/public-records.page And they’ve been managing, analyzing and searching this database for ...

Read More »

Apache Cassandra and Low-Latency Applications

apache-cassandra-logo

Introduction Over the years, Grid Dynamics has had many projects related to NoSQL, particularly Apache Cassandra. In this post, we want to discuss a project which brought exciting challenges to us, and questions we tried to answer in that project remain relevant today, as well. Digital marketing and online ads were popular in 2012, and a demand for them has ...

Read More »

How to install 3 popular NoSQL databases for development in MacOSX for newbies Part 2 (Cassandra)

apache-cassandra-logo

Part 1(Mongodb) Part 2 (Cassandra) Part 3 (CouchBase) In this post we will install on our local MacOSX (Yosemite) environment, another famous NoSQL database, the Apache Cassandra. As a developer, I found Datastax, community packaging of Cassandra, very handy, seems they offer ready made MacOSX specific packages. You can of course download and install the the tarballs from Apache or ...

Read More »

Getting Started with Apache Cassandra and Java (Part II)

apache-cassandra-logo

Requirements To follow this tutorial, you should already have a running instance of Cassandra (a small cluster would be good, but not necessary), the Datastax Java driver installed (refer to Part I), and have gone through the 10 minute walkthrough here: http://planetcassandra.org/create-a-keyspace-and-table/.           Try it out For this demo, we’re going to be creating a simple ...

Read More »

Getting Started with Apache Cassandra and Java (Part I)

apache-cassandra-logo

On this page, you’ll learn just enough to get started with NoSQL Apache Cassandra and Java, including how to install, try out some basic commands and what to do next. Requirements To follow this tutorial, you should already have a running Cassandra instance, and have gone through the 10 minute walkthrough here: http://planetcassandra.org/create-a-keyspace-and-table/. If you’ve already reviewed part I, be ...

Read More »

Custom Cassandra Data Types

apache-cassandra-logo

In the blog post Connecting to Cassandra from Java, I mentioned that one advantage for Java developers of Cassandra being implemented in Java is the ability to create custom Cassandra data types. In this post, I outline how to do this in greater detail. Cassandra has numerous built-in data types, but there are situations in which one may want to ...

Read More »

Connecting to Cassandra from Java

apache-cassandra-logo

In my post Hello Cassandra, I looked at downloading the Cassandra NoSQL database and using cqlsh to connect to a Cassandra database. In this post, I look at the basics of connecting to a Cassandra database from a Java client. Although there are several frameworks available for accessing the Cassandra database from Java, I will use the DataStax Java Client ...

Read More »

Reactive Cassandra

apache-cassandra-logo

Or an adventure on reading data reactively from Cassandra. Overview Let’s first try to define what reactive means from programming point of view. Functional reactive programming is programming paradigm for reactive programming using the building blocks of functional programming. Functional programming is a programming paradigm, a style of building the structure and the elements of computer programs, that treats computation, ...

Read More »

Crawling the Web with Cassandra and Nutch

apache-cassandra-logo

So, you want to harvest a massive amount of data from the internet?  What better storage mechanism than Cassandra?  This is easy to do with Nutch. Often people use Hbase behind Nutch.  This works, but it may not be an ideal solution if you are (or want to be) a Cassandra shop.   Fortunately, Nutch 2+ uses the Gora abstraction layer ...

Read More »
Want to take your Java Skills to the next level?
Grab our programming books for FREE!
  • Save time by leveraging our field-tested solutions to common problems.
  • The books cover a wide range of topics, from JPA and JUnit, to JMeter and Android.
  • Each book comes as a standalone guide (with source code provided), so that you use it as reference.
Last Step ...

Where should we send the free eBooks?

Good Work!
To download the books, please verify your email address by following the instructions found on the email we just sent you.