In most cases, the best place to get some data is right there in some website. The data owner might not have an API and is not really interested in providing you with data in any other form. So you are stuck trying to figure out how to scrape that data and makes sense of it...
This is where web scraping comes in - building small applications which understand the semantics of some websites and can figure out how to extract and categorize information, eventually with even some machine learning. This talk goes through the basics of web scraping and teaches you how you could build them and not get IP-banned by every site out there.
Смотрите видео DPC13: Growing spiders to crawl the web - Juozas Kaziukénas онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Ibuildings Dutch PHP Conference 13 Декабрь 2013, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 173 раз и оно понравилось 2 людям.