Home » Apache Kafka » Page 2

Tag Archives: Apache Kafka

Machine Learning Trends of 2018 combined with the Apache Kafka Ecosystem

java-interview-questions-answers

At OOP 2018 conference in Munich, I presented an updated version of my talk about building scalable, mission-critical microservices with the Apache Kafka ecosystem and Deep Learning frameworks like TensorFlow, DeepLearning4J or H2O. I want to share the updated slide deck and discuss a few updates about newest trends, which I incorporated into the talk. The main story is the ...

Read More »

Introduction to Apache Kafka

java-interview-questions-answers

What is Apache Kafka? Apache Kafka is a distributed streaming system with publish and subscribe the stream of records. In another aspect it is an enterprise messaging system. It is highly fast, horizontally scalable and fault tolerant system. Kafka has four core APIs called, Producer API: This API allows the clients to connect to Kafka servers running in cluster and ...

Read More »

Apache Kafka Streams + Machine Learning (Spark, TensorFlow, H2O.ai)

java-interview-questions-answers

I started at Confluent in May 2017 to work as Technology Evangelist focusing on topics around the open source framework Apache Kafka. I think Machine Learning is one of the hottest buzzwords these days as it can add huge business value in any industry. Therefore, you will see various other posts from me around Apache Kafka (messaging), Kafka Connect (integration), ...

Read More »

Why I Move (Back) to Open Source for Messaging, Integration and Stream Processing

After three great years at TIBCO Software, I move back to open source and join Confluent, a company focusing on the open source project Apache Kafka to build mission-critical, scalable infrastructures for messaging, integration and streaming analytics. Confluent is a Silicon Valley startup, still in the beginning of its journey, with a 700% growing business in 2016, and is expected ...

Read More »

Important Production bugs and fixes for Storm and Kafka integration

java-interview-questions-answers

I will describe here a few details for Storm and Kafka integration modules, a few important bugs that you should be aware and how to overcome some of them (especially for production installations). I am heavily using Apache Storm in production installations with Kafka as my main input source (Spout). Storm integration modules with Kafka and versions: Storm 0.x supports ...

Read More »

Perfecting Lambda Architecture with Oracle Data Integrator (and Kafka / MapR Streams)

java-interview-questions-answers

“Lambda architecture is a data-processing architecture designed to handle massive quantities of data by taking advantage of both batch– and stream-processing methods. This approach to architecture attempts to balance latency, throughput, and fault-tolerance by using batch processing to provide comprehensive and accurate views of batch data, while simultaneously using real-time stream processing to provide views of online data. The two view outputs may be joined before presentation. The rise ...

Read More »

Getting Started with Kafka REST Proxy for MapR Streams

java-interview-questions-answers

MapR Ecosystem Package 2.0 (MEP) is coming with some new features related to MapR Streams: Kafka REST Proxy for MapR Streams provides a RESTful interface to MapR Streams and Kafka clusters, making it easy to consume and produce messages as well as perform administrative operations. Kafka Connect for MapR Streams is a utility for streaming data between MapR Streams and Apache Kafka ...

Read More »

Getting Started With Kafka REST Proxy for MapR Streams

java-interview-questions-answers

Introduction MapR Ecosystem Package 2.0 (MEP) is coming with some new features related to MapR Streams: Kafka REST Proxy for MapR Streams provides a RESTful interface to MapR Streams and Kafka clusters to consume and product messages and to perform administrative operations. Kafka Connect for MapR Streams is a utility for streaming data between MapR Streams and Apache Kafka and ...

Read More »

Performance Tuning of an Apache Kafka/Spark Streaming System

Real-world case study in the telecom industry Debugging a real-life distributed application can be a pretty daunting task. Most common Google searches don’t turn out to be very useful, at least at first. In this blog post, I will give a fairly detailed account of how we managed to accelerate by almost 10x an Apache Kafka/Spark Streaming/Apache Ignite application and ...

Read More »