Introduction to PySpark: Data Manipulation Basics

Published: 25 March 2021
on channel: Data Analysis Lab
288
like

In this session we cover PySpark basics such as reading in data, filtering, joining, selecting/dropping columns, and creating new columns. We also cover how to use Koalas, which allows you to use Python Pandas inside Databricks that uses parallel processing!

To gain access to code, data, and course materials visit https://kelseyemnett.com/2021/03/26/s....


Watch video Introduction to PySpark: Data Manipulation Basics online without registration, duration hours minute second in high quality. This video was added by user Data Analysis Lab 25 March 2021, don't forget to share it with your friends and acquaintances, it has been viewed on our site 288 once and liked it like people.