Search Results for: spark

Python

Yatin BatraApril 3rd, 2024
0 69

PySpark – Create Empty Dataframe and RDD

DataFrames and RDDs (Resilient Distributed Datasets) are fundamental abstractions in Apache Spark, a powerful distributed computing framework. Let us delve…
Read More »
Software Development

Yatin BatraJanuary 22nd, 2024
0 196

Apache Spark: Unleashing Big Data Power

1. Introduction Apache Spark is a powerful open-source, distributed computing system that has become a cornerstone in the world of…
Read More »
Java Code GeeksAugust 15th, 2023
29

Apache Spark Cheatsheet

This cheatsheet is designed to provide quick access to the most commonly used Spark components, methods, and practices. Whether you’re…
Read More »
Software Development

Odysseas MourtzoukosAugust 11th, 2023
0 929

Apache Spark Cheatsheet

1. Introduction to Apache Spark 1.1 What is Apache Spark? Apache Spark is an open-source, distributed computing system designed for…
Read More »
Core Java

Venkatesh NukalaJune 13th, 2021
0 923

Java Spark RDD reduce() Examples – sum, min and max operations

A quick guide to explore the Spark RDD reduce() method in java programming to find sum, min and max values…
Read More »
Software Development

Arnon Rotem Gal OzDecember 15th, 2020
0 50

Where is Apache Spark heading?

I watched (COVID19-era version of “attended”) the latest spark Summit and in one of the keynotes Reynold Xin from Databricks,…
Read More »
Enterprise Java

Ederson CorbariOctober 8th, 2019
0 89

Recommendation System Using Spark ML Akka and Cassandra

Building a recommendation system with Spark is a simple task. Spark’s machine learning library already does all the hard work…
Read More »
Enterprise Java

Guglielmo IozziaMay 9th, 2019
0 188

The Kubernetes Spark operator in OpenShift Origin (Part 1)

This series is about the Kubernetes Spark operator by Radanalytics.io onOpenShift Origin. It is an Open Source operator to manageApache…
Read More »
Enterprise Java

Guglielmo IozziaFebruary 1st, 2019
1 377

Sparklens: a tool for Spark applications optimization

Sparklens is a profiling tool for Spark with a built-in Spark Scheduler simulator: it makes easier to understand the scalability…
Read More »

1
2
3
»
10
20
...
Last

Thank you!