In this video, we'll see how you can scrape complex webpages, including pages that use hydration( ie. load their data in the browser using JavaScript), in Python using the Playwright framework.
This lets you scrape websites that cannot be scraped using simpler tools such as the requests library.
Using Playwright, you can also take screenshots, scrape websites that require authentication, and much more. Playwright is a modern alternative to Selenium. It is comparable to Puppeteer, with the difference that Playwright is not JavaScript-only, it provides official Python bindings.
👍 Please like if you found this video helpful, and subscribe to stay updated with my latest tutorials. 🔔
The code is available here: https://github.com/Vincent-Codes-Fina...
❤️ You can support this channel by buying me a ☕: https://buymeacoffee.com/codesfinance
🔖 Chapters:
00:00 Intro
02:17 Scraping Using Requests
05:16 Scraping Using Playwright
09:13 Converting Table to DataFrame
10:46 Screenshot
12:12 Other Features
🔗 Video links:
Playwright: https://playwright.dev/
🐍 More Vincent Codes Finance:
✍🏻 Blog: https://vincent.codes.finance
🐦 X: / codesfinance
🧵 Threads: https://www.threads.net/@codesfinance
😺 GitHub: https://github.com/Vincent-Codes-Finance
📘 Facebook: / 61559283113665
👨💼 LinkedIn: / vincent-codes-finance
🎓 Academic website: https://www.vincentgregoire.com/
#scraping #python #playwright #programming #code #nlp #opensource #pandas #puppeteer #selenium #bigdata #research #researchtips #vscode #professor #datascience #dataanalytics #dataanalysis #webscraping
Смотрите видео Playwright: Advanced Web Scraping in Python онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Vincent Codes Finance 16 Июнь 2024, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 1,109 раз и оно понравилось 50 людям.