Home » Tag Archives: Apache Cassandra (page 4)

Tag Archives: Apache Cassandra

Connecting to Cassandra from Java

In my post Hello Cassandra, I looked at downloading the Cassandra NoSQL database and using cqlsh to connect to a Cassandra database. In this post, I look at the basics of connecting to a Cassandra database from a Java client. Although there are several frameworks available for accessing the Cassandra database from Java, I will use the DataStax Java Client ...

Read More »

Reactive Cassandra

Or an adventure on reading data reactively from Cassandra. Overview Let’s first try to define what reactive means from programming point of view. Functional reactive programming is programming paradigm for reactive programming using the building blocks of functional programming. Functional programming is a programming paradigm, a style of building the structure and the elements of computer programs, that treats computation, ...

Read More »

Crawling the Web with Cassandra and Nutch

So, you want to harvest a massive amount of data from the internet?  What better storage mechanism than Cassandra?  This is easy to do with Nutch. Often people use Hbase behind Nutch.  This works, but it may not be an ideal solution if you are (or want to be) a Cassandra shop.   Fortunately, Nutch 2+ uses the Gora abstraction layer ...

Read More »

Practical NoSQL experiences with Apache Cassandra

Most of the backend systems I’ve worked with over the years have employed relational database storage in some role. Despite many application developers complaining about RDBMS performance, I’ve found that with good design and implementation a relational database can actually scale a lot further than developers think. Often software developers who don’t really understand relational databases tend to blame the ...

Read More »

What is Big Data – Theory to Implementation

What is Big Data? You may ask; and more importantly why it is the latest trend in nearly every business domain? Is it just a hype or its here to stay? As a matter of fact “Big Data” is a pretty straightforward term – its just what its says – a very large data-set. How large? The exact answer is ...

Read More »

Running Cassandra in a Multi-node Cluster

This post gathers the steps I followed in setting up an Apache Cassandra cluster in multi-node. I have referred Cassandra wiki and Datastax documentation in setting up my cluster. The following procedure is expressed in details, sharing my experience in setting up the cluster. Setting up first node Adding other nodes Monitoring the cluster – nodetool, jConsole, Cassandra GUI I ...

Read More »

A SMALL cross-section of BIG Data

Big data is a term applied to data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Big data sizes are a constantly moving target currently ranging from a few dozen terabytes to many petabytes of data in a single data set. IDC estimated the ...

Read More »

Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase comparison

Kristóf Kovács performed a very interesting brief comparison among six of the most popular “NoSQL” database implementations. Additionally to Kristóf’s work, I would like to provide some links which I believe will help all of you who are interested in following the “NoSQL” community along : NoSQLDatabases – http://www.nosqldatabases.com/ myNoSQL – http://nosql.mypopescu.com/ NoSQL – http://nosql-database.org/ Best Regards Justin

Read More »