In this video I explain how you can scale python pandas to handle millions of records using libraries like Dask and Modin. I also show that if your dataset can fit into main memory then pandas is much faster than Dask and Modin. Dask and Modin are better suited to distributed computing scenarios.
If you like such content please subscribe to the channel here: https://www.youtube.com/c/RitheshSree...
If you like to support me financially, It is totally optional and voluntary. Buy me a coffee here: https://www.buymeacoffee.com/rithesh
Relevant Links:
https://www.datarevenue.com/en-blog/p...
https://modin.readthedocs.io/en/stable/
https://docs.dask.org/en/stable/dataf...
Watch video Scaling Python Pandas for handling millions of records: Dask , Modin online without registration, duration hours minute second in high quality. This video was added by user Rithesh Sreenivasan 09 December 2021, don't forget to share it with your friends and acquaintances, it has been viewed on our site 2,506 once and liked it 46 people.