Web Scraping Tutorials


Step by step tutorials for web scraping, web crawling, data extraction, headless browsers, etc. Our web scraping tutorials are usually written in Python using libraries such as LXML or Beautiful Soup and occasionally in Node.js. The full source code is available to download or clone using Git.

All Tutorials

How to take screenshots using Puppeteer

How to take screenshots using Puppeteer

Learn how to take screenshots of entire web page, a specific area or different view ports in Google Chrome, Chrome Headless or Chromium using Puppeteer and Node JS, for debugging tests or for web scraping

Web Scraping with Puppeteer and NodeJS

Web Scraping with Puppeteer and NodeJS

Puppeteer is a node.js library which provides a powerful but simple API that allows you to control Google’s Chrome browser. In this tutorial post, we will show you how to build a web scraper and control chrome using puppeteer and node.js to the scrape details of hotel listings from booking.com

Web Scraping Tutorial for Beginners – Part 3 – Navigating and Extracting Data

Web Scraping Tutorial for Beginners – Part 3 – Navigating and Extracting Data

Part 3 of our Web Scraping for Beginners Series. Learn how to navigate and extract data from Reddit using Python 3 and BeautifulSoup.

How to Scrape Coupon Details from a Walmart Store using Python and LXML

How to Scrape Coupon Details from a Walmart Store using Python and LXML

Tutorial to build a web scraper to extract coupon details from Walmart.com, a leading retail store in the U.S, based on a store ID. We will extract details such as store name, address, contact details and more using Python 3, Python Requests and LXML.

How to Scrape Store Locations from Walmart.com using Python 3

How to Scrape Store Locations from Walmart.com using Python 3

Tutorial to build a web scraper to extract store locations and its details from Walmart.com, a leading retailer in the U.S. We will extract details such as store name, address, contact details and more using Python 3 and Python Requests.

How To Rotate Proxies and IP Addresses using Python 3

How To Rotate Proxies and IP Addresses using Python 3

When scraping many pages from a website, using the same IP addresses will lead to getting blocked. A way to avoid this is by rotating IP addresses that can prevent your scrapers from being disrupted. In this tutorial, we will show you how to rotate IP addresses to prevent getting blocked while scraping.

Turn the Internet into meaningful, structured and usable data