DPC13: Growing spiders to crawl the web - Juozas Kaziukénas

Published: 13 December 2013
on channel: Ibuildings Dutch PHP Conference
173
2

In most cases, the best place to get some data is right there in some website. The data owner might not have an API and is not really interested in providing you with data in any other form. So you are stuck trying to figure out how to scrape that data and makes sense of it...

This is where web scraping comes in - building small applications which understand the semantics of some websites and can figure out how to extract and categorize information, eventually with even some machine learning. This talk goes through the basics of web scraping and teaches you how you could build them and not get IP-banned by every site out there.


Watch video DPC13: Growing spiders to crawl the web - Juozas Kaziukénas online without registration, duration hours minute second in high quality. This video was added by user Ibuildings Dutch PHP Conference 13 December 2013, don't forget to share it with your friends and acquaintances, it has been viewed on our site 173 once and liked it 2 people.