2020-04-29 | Watch the video | This folder contains the presentation and sample notebooks
Join us for Part 4 of our online learning series: Introduction to Data Analysis for Aspiring Data Scientists. This is the final online workshop in this series for anyone and everyone interested in learning about data analysis.
Part 4: Introduction to Apache Spark
Abstract: This workshop covers the fundamentals of Apache Spark, the most popular big data processing engine. In this workshop, you will learn how to ingest data with Spark, analyze the Spark UI, and gain a better understanding of distributed computing. We will be using data released by the New York Times. No prior knowledge of Spark is required, but Python experience is highly recommended.
Who should attend this workshop: Anyone and everyone, CS students and even non-technical folks are welcome to join.