Apache Spark
This gallery shows examples of how to use Apache Spark with SWAN.
See also the online, self-paced training course on Apache Spark.
Spark for data processing, analytics, and machine learning
Tutorial on Spark DataFrame API
![Click to open this example](/notebooks/SparkTraining/notebooks/Tutorial-DataFrame.png)
Tutorial on Spark SQL
![Click to open this example](/notebooks/SparkTraining/notebooks/Tutorial-SparkSQL.png)
Exercises on Spark SQL
![Click to open this example](/notebooks/SparkTraining/notebooks/HandsOn-SparkSQL_with_solutions.png)
JDBC Tutorial - Reading from Oracle using Spark
![Click to open this example](/notebooks/SparkTraining/notebooks/Spark_JDBC_Oracle.png)
Machine Learning with Apache Spark - Classifier
![Click to open this example](/notebooks/SparkTraining/notebooks/ML_Demo1_Classifier.png)
Machine Learning with Apache Spark - Regression
![Click to open this example](/notebooks/SparkTraining/notebooks/ML_Demo2_Regression.png)
Running Spark on CERN Hadoop clusters
Analytix cluster: Example of Spark on Hadoop
![Click to open this example](/notebooks/SparkTraining/notebooks/Demo_Spark_on_Hadoop.png)
Analytix cluster: How to run the TPC-DS benchmark at scale with Spark and Hadoop
![Click to open this example](/notebooks/SparkTraining/notebooks/TPCDS_PySpark_CERN_SWAN_getstarted.png)
NXCALS: Spark for analyzing LHC logging data
![Click to open this example](/notebooks/SparkTraining/notebooks/NXCals-example.png)
NXCALS: Spark vector data and timestamps
![Click to open this example](/notebooks/SparkTraining/notebooks/NXCals-example_bis.png)