Big Data
-
Software Development
Efficiency Unleashed: A Comprehensive Guide to Streamlining Operations through Automated ETL Workflows
In the era of data-driven decision-making, the power to transform raw data into actionable insights lies at the heart of…
Read More » -
Enterprise Java
BigQuery Storage API: Avro
Previously we had an introduction on the BigQuery Storage API and we proceeded reading data using the Arrow format. In…
Read More » -
Enterprise Java
BigQuery Storage API: Arrow
Previously we had an introduction on the BigQuery Storage API. As explained the storage API of BigQuery supports two formats.…
Read More » -
Enterprise Java
BigQuery Storage API: Get Started and Comparisons
BigQuery provides us with the Storage API for fast access using an rpc-based protocal. With this option you can receive…
Read More » -
Core Java
Apache Arrow on the JVM: Streaming Writes
Previously we went to create some schemas on Arrow. On this blog we will have a look on writing through…
Read More » -
Core Java
Apache Arrow on the JVM: Get Started and Schemas
Arrow is memory format for flat and hierarchical data. It is a popular format used by various big data tools,…
Read More » -
Software Development
Where is Apache Spark heading?
I watched (COVID19-era version of “attended”) the latest spark Summit and in one of the keynotes Reynold Xin from Databricks,…
Read More » -
Enterprise Java
Processing real-time data with Storm, Kafka and ElasticSearch – Part 1
This is an article of processing real-time data with Storm, Kafka and ElasticSearch. 1. Introduction How would you process a…
Read More » -
Enterprise Java
Popular frameworks for big data processing in Java
The big data challenge The concept of big data is understood differently in the variety of domains where companies face…
Read More »