Introduction to PySpark: Architecture Basics & Data Exploration

Published: 19 February 2021
on channel: Data Analysis Lab
181
like

In this class I cover the basics of Spark architecture and parallel processing. I explain the basics of data partitioning, the makeup of the Spark cluster, and the difference between transformations and actions. I then show the basics of displaying and exploring your data with grouped aggregations.

To gain access to code, data, and course materials visit https://kelseyemnett.com/2021/02/20/s....


Watch video Introduction to PySpark: Architecture Basics & Data Exploration online without registration, duration hours minute second in high quality. This video was added by user Data Analysis Lab 19 February 2021, don't forget to share it with your friends and acquaintances, it has been viewed on our site 181 once and liked it like people.