Simple and Fast Web Scraping using Python | Requests_HTML | IMDb movies Scraping | Data Analytics
Python is a powerful programming langauage which supports libraries which can be used for variety of tasks like APIs, data analysis, machine learning, etc. [1] Anaconda is an open-source distribution of the Python and R programming languages for data science that aims to simplify package management and deployment. The Jupyter Notebook is the original web application for creating and sharing computational documents. It offers a simple, streamlined, document-centric experience. [2] Requests HTML library intends to make parsing HTML (e.g. scraping the web) as simple and intuitive as possible. [3] In this video, we will be learning: • Different web scraping libraries in Python • How to import libraries and run python scripts in jupyter notebook • How to scrape multiple data points from one website using requests_html library • How to scrape multiple websites using requests_html library Timeline: • Web scraping libraries - 0:27 • iMDB website scraping scope - 1:28 • Jupyter notebook and libraries - 2:30 • Web scraping python script - 4:34 • Extracting xpath of different data points from website - 6:30 • Extracting iMDB movie links xpath from a python set - 13:15 • How to work with scraped data, tuple and dataframe - 18:05 • Web scraping python for loop - 20:00 How can web scraping help you? • E-commerce - product details extraction, product price scraping, pricing analysis • Business intelligence - extract data for taking business decisions • Personal use - scrape any website using this method to get the data in excel, filter it using different fields to reduce your time scrolling and checking * The information on the videos and the resources available for download/viewing through this YouTube Channel are for educational and informational purposes only. Source: 1. https://www.python.org/doc/essays/blurb/ 2. https://jupyter.org/ 3. https://requests.readthedocs.io/projects/requests-html/en/latest/ 4. https://pypi.org/project/requests-html/ 5. https://pypi.org/project/beautifulsoup4/ 6. https://selenium-python.readthedocs.io/
Download
0 formatsNo download links available.