Developers


How To Optimize Playwright Web Scrapers Using Code Profiling Tools

How To Optimize Playwright Web Scrapers Using Code Profiling Tools

This article guides you to understand the method of code profiling. You also learn to optimize Playwright web scrapers using code profiling tools.

How Web Scraping With Excel Works: A Tutorial

How Web Scraping With Excel Works: A Tutorial

This tutorial deals with a detailed explanation of the Web Query feature in Excel and how web scraping with Excel is carried out using Web Query.

How To Block Specific Resources in Playwright

How To Block Specific Resources in Playwright

This article explains how you can block specific resources in Playwright. The later section also gives an explanation of how to block requests in Chrome.

What Are The Essential HTTP Headers for Web Scraping?

What Are The Essential HTTP Headers for Web Scraping?

Through this article, you will learn about various HTTP headers for web scraping. Also, learn some tips for using headers in web scraping.

How to fake and rotate User Agents using Python 3

How to fake and rotate User Agents using Python 3

When scraping many pages from a website, using the same user-agent consistently leads to the detection of a scraper. A way to bypass that detection is by faking your user agent and changing it with every request you make to a website. In this tutorial, we will show you how to fake user agents, and randomize them to prevent getting blocked while scraping websites.

How To Rotate Proxies and change IP Addresses using Python 3

How To Rotate Proxies and change IP Addresses using Python 3

When scraping many pages from a website, using the same IP addresses will lead to getting blocked. A way to avoid this is by rotating IP addresses that can prevent your scrapers from being disrupted. In this tutorial, we will show you how to rotate IP addresses to prevent getting blocked while scraping.

How To Scrape Amazon Product Data and Prices using Python 3

How To Scrape Amazon Product Data and Prices using Python 3

Learn to scrape Amazon using Python. Extract Amzaon product details like Name, Price, ASIN and more by scraping Amazon.

How do websites detect and block bots using Bot Mitigation Tools

How do websites detect and block bots using Bot Mitigation Tools

An in-depth analysis of how most of the bot mitigation tools work, and how they distinguish between bots and humans on the server-side and client-side, going through the fundamentals of the web.

How to Scrape Booking.com for Hotel Data

How to Scrape Booking.com for Hotel Data

Scrape Hotels data from Booking.com. Scrape Booking.com for hotel data such as name, location, room type, price, rating and number of reviews

Building an Amazon Product Reviews API using Python Flask

Building an Amazon Product Reviews API using Python Flask

Build and host your own FREE Amazon Reviews API using Python and a free Web scraper tool called Selectorlib

Scalable Large Scale Web Scraping – How to build, maintain and run scrapers

Scalable Large Scale Web Scraping – How to build, maintain and run scrapers

Here are the high-level steps involved in this process and we will go through each of these in detail – Building scrapers, Running web scrapers at scale, Getting past anti-scraping techniques, Data Validation and Quality Control & Ongoing Maintenance

How to disable images and CSS in Puppeteer to speed up web scraping

How to disable images and CSS in Puppeteer to speed up web scraping

Learn how to disable images and CSS of an entire web page using Google Chrome Headless or Chromium using Puppeteer and Node JS, for debugging tests or for web scraping

Turn the Internet into meaningful, structured and usable data