In this video, we look at how you can scrape product data from Amazon using Python Scrapy. We look at how to search for a specific item, crawl through the search results and then parse the associated product data.
The article that goes along with this video:
https://scrapeops.io/python-scrapy-playbook/python-scrapy-amazon-product-scraper/
Basic Scrapy project used in this tutorial:
https://github.com/python-scrapy-playbook/basic-scrapy-project
Completed Scapy Project which you can just download and run (after adding your account details!!): https://github.com/python-scrapy-playbook/amazon-python-scrapy-scraper
Link to the ScrapeOps proxy: https://scrapeops.io/proxy-aggregator/
00:00 - Intro
00:46 - How the amazon pages are structured
02:39 - Installing the basic spider
04:52 - Getting the proxy setup
08:22 - Creating the amazon product spider
13:51 - Identifying the items on the page with CSS selectors
16:47 - Running the spider
21:14 - Recap & Outro