Developers


How to disable images and CSS in Puppeteer to speed up web scraping

How to disable images and CSS in Puppeteer to speed up web scraping

Learn how to disable images and CSS of an entire web page using Google Chrome Headless or Chromium using Puppeteer and Node JS, for debugging tests or for web scraping

How To Make  Anonymous Requests using TorRequests and Python

How To Make Anonymous Requests using TorRequests and Python

Tor is quite useful when you have to use requests without revealing your IP address, especially when you are web scraping. This tutorial will use a wrapper in python that helps you with the same.

How to take screenshots using Puppeteer

How to take screenshots using Puppeteer

Learn how to take screenshots of entire web page, a specific area or different view ports in Google Chrome, Chrome Headless or Chromium using Puppeteer and Node JS, for debugging tests or for web scraping

Web Scraping with Puppeteer and NodeJS

Web Scraping with Puppeteer and NodeJS

Puppeteer is a node.js library which provides a powerful but simple API that allows you to control Google’s Chrome browser. In this tutorial post, we will show you how to build a web scraper and control chrome using puppeteer and node.js to the scrape details of hotel listings from booking.com

How to Solve Simple Captchas using Python Tesseract

How to Solve Simple Captchas using Python Tesseract

CAPTCHA stands for Completely Automated Public Turing test to tell Computers and Humans Apart. As the acronym suggests, it is a test used to determine whether the user is human or not. A typical captcha consists of a distorted test, which a computer program cannot interpret but a human can (hopefully) still read. This tutorial will […]

How to Parse Addresses using Python and Google GeoCoding API

How to Parse Addresses using Python and Google GeoCoding API

Web scraping can often lead to you having scraped address data which are unstructured. If you have come across a large number of freeform address as a single string, for example – “9 Downing St Westminster London SW1A, UK”,  you know how hard it would be to validate, compare and deduplicate these addresses. To start […]

Turn the Internet into meaningful, structured and usable data