Databricks Platform Features - Deep Dive into Delta Lake using PySpark | Data Engineering

Published: 01 January 1970
on channel: itversity
5,311
82

Delta Lake is a key feature of the Databricks Platform, extensively used to build Enterprise Data Lakehouses on cloud platforms like AWS, Azure, and GCP. In this video, you’ll learn how to use Delta Lake with Spark DataFrames and Python to perform operations like data updates, deletes, merges, and snapshot recovery.

What You’ll Learn:
• Creating DataFrames for Delta Lake
• Writing and reading data in Delta format
• Performing updates, deletes, and merges
• Implementing point-in-time recovery with Delta logs
• Cleaning up Delta files using Vacuum and compaction

Explore More with Our Udemy Courses:

🟢 Data Engineering using Databricks on AWS and Azure (BESTSELLER)
Learn key Databricks features like Delta Lake, Databricks Jobs, and Clusters with real-world demos.
👉 https://www.udemy.com/course/data-eng...

🟢 Databricks Certified Associate Developer - Apache Spark 2022 (NEW & HOT)
Prepare for the Databricks Certified Developer Exam with hands-on PySpark training and exam tips.
👉 https://www.udemy.com/course/databric...

0:00:00 — Introduction to Delta Lake using Data Frames on Databricks Platform
0:02:59 — Creating Data Frames for Delta Lake on Databricks Platform
0:09:16 — Writing Data Frame using Delta Format on Databricks Platform
0:13:56 — Updating Existing Data using Delta Format on Databricks Platform
0:19:55 — Delete Existing Data using Delta Format on Databricks Platform
0:24:45 — Merge or Upsert Data using Delta Format on Databricks Platform
0:34:34 — Deleting using Merge in Delta Lake on Databricks Platform
0:42:26 — Point in Snapshot Recovery using Delta Logs on Databricks Platform
0:50:46 — Deleting unnecessary Delta Files using Vacuum on Databricks Platform
1:00:09 — Compaction of Delta Lake Files on Databricks Platform

Helpful Resources:

Are you interested in learning Delta Lake using Spark SQL? Watch this video:
👉    • Databricks Platform Features -  Deep ...  
Complete playlist on Databricks Features:
👉    • Data Engineering using Databricks - D...  

For material and support, sign up for our Udemy course:
👉 https://www.udemy.com/course/data-eng...

Stay connected with ITVersity for updates:
• Newsletter: http://notifyme.itversity.com
• LinkedIn:   / itversity  
• Facebook:   / itversity  
• Twitter:   / itversity  
• Instagram:   / itversity  
• YouTube:    / itversityin  

🔔 Subscribe to ITVersity for more tutorials:
https://www.youtube.com/channel/itver...

#dataengineering #bigdata #analytics #databricks #spark #cloudcomputing


Watch video Databricks Platform Features - Deep Dive into Delta Lake using PySpark | Data Engineering online without registration, duration hours minute second in high quality. This video was added by user itversity 01 January 1970, don't forget to share it with your friends and acquaintances, it has been viewed on our site 5,311 once and liked it 82 people.