Home » Tag Archives: Big Data

Tag Archives: Big Data

The Essential Guide to Streaming-first Processing with Apache Flink

software-development-2-logo

Editor’s note: This is a post by Apache Flink PMC members Fabian Hueske and Kostas Tzoumas. Fabian and Kostas are also co-founders of data Artisans.  A very large part of today’s data processing is done on data that is continuously produced, e.g., data from user activity logs, web logs, machines, sensors, and database transactions. Until now, data streaming technology was lacking in several ...

Read More »

Bet Super Bowl 50 Like A Boss with Apache Spark

apache-spark-logo

This time, it’s personal. Super Bowl 50 is being played at Levi’s Stadium in Santa Clara – within sight of many of the world’s most innovative technology companies, including MapR. It’s the Silicon Valley Super Bowl* so it only makes sense that this will be the most over-analyzed event in history (at least until the next big game). Big events ...

Read More »

Streaming in the Extreme

software-development-2-logo

Are you ready to start streaming all the events in your business? What happens to your streaming solution when you outgrow your single data center? What happens when you are at a company that is already running multiple data centers and you need to implement streaming across data centers? What about when you need to scale to a trillion events ...

Read More »

Qualitative Data: The Context that Gives Meaning to Your Big Data

software-development-2-logo

Someone once said “if you can’t measure something, you can’t understand it.” Another version of this belief says: “If you can’t measure it, it doesn’t exist.” This is a false way of thinking – a fallacy – in fact it is sometimes called the McNamara fallacy. This mindset can have dire consequences in national affairs as well as in personal ...

Read More »

Top 10 Big Data Trends in 2016 for Financial Services

software-development-2-logo

2015 was a groundbreaking year for banking and financial markets firms, as they continue to learn how big data can help transform their processes and organizations. Now, with an eye towards what lies ahead for 2016, we see that financial services organizations are still at various stages of their activity with big data in terms of how they’re changing their ...

Read More »

MapReduce Design Patterns Implemented in Apache Spark

apache-spark-logo

This blog is a first in a series that discusses some design patterns from the book MapReduce design patterns and shows how these patterns can be implemented in Apache Spark(R). When writing MapReduce or Spark programs, it is useful to think about the data flows to perform a job. Even if Pig, Hive, Apache Drill and Spark Dataframes make it ...

Read More »

Introduction to Apache Spark with Examples and Use Cases

apache-spark-logo

I first heard of Spark in late 2013 when I became interested in Scala, the language in which Spark is written. Some time later, I did a fun data science project trying to predict survival on the Titanic. This turned out to be a great way to get further introduced to Spark concepts and programming. I highly recommend it for ...

Read More »

Changing the Game When it Comes to Auditing in Big Data – Part 2

software-development-2-logo

In my previous blog post we enabled auditing at the various levels of your MapR cluster. In this follow up post we will analyse the audit logs using Apache Drill to start answering questions like: Unauthorized cluster changes and data access Complying with regulatory frameworks and legislation Data usage heatmaps on cold, warm and hot data Data access analytics and ...

Read More »

Want to take your Java skills to the next level?

Grab our programming books for FREE!

Here are some of the eBooks you will get:

  • Advanced Java Guide
  • Java Design Patterns
  • JMeter Tutorial
  • Java 8 Features Tutorial
  • JUnit Tutorial
  • JSF Programming Cookbook
  • Java Concurrency Essentials