Here we look at how to correct our scraped data when we run into issues such as:
duplicate data
missing data
We also look at how to use Scrapy item loaders and Scrapy Pipelines to process our data before we save it.
Useful links:
The github repo where you can download the code:
https://github.com/ScrapeOps/python-s...
The article which corresponds to this video:
https://thepythonscrapyplaybook.com/s...
00:00 - Intro
01:17 - Using Scrapy Items to better control our data
05:00 - Using Scrapy Item Loaders to remove a symbol from our scraped prices
14:15 - Using Scrapy Item Pipelines, converting pounds to dollars and removing duplicates
24:35 - What's coming in part 3
Watch video Cleaning & Post-Processing Scrapy Data - Python Scrapy Beginners Series (Part 2) online without registration, duration hours minute second in high quality. This video was added by user ScrapeOps 24 August 2022, don't forget to share it with your friends and acquaintances, it has been viewed on our site 9,557 once and liked it 151 people.