Scrape job posting data from Indeed using Google Chrome

This tutorial will show you how to extract job information from Indeed using Web Scraper Chrome Extension. It helps to gather basic data regarding jobs posted on indeed. You can use this to monitor jobs that fit your profile, location, salary, company and job title.

What data are we extracting?

  1. Employer Name
  2. Job Title
  3. Job Location
  4. Job Description
  5. Rating
  6. Reviews

Below is an annotated screenshot of the data fields we will be extracting:

indeed-details-to-extract

Prerequisites

  • Google Chrome Browser – You will need to download the Chrome browser. The extension requires Chrome 49+.
  • Web Scraper Chrome Extension – The Web Scraper extension can be downloaded from the Chrome Web Store.  After downloading the extension you will see a spider icon in your browser toolbar.

If you don't like or want to code, the ScrapeHero Cloud is just right for you!

Skip the hassle of installing software, programming and maintaining the code. Run this scraper in the ScrapeHero Cloud within seconds

Run this in the Cloud for FREE
Deploy to ScrapeHero Cloud

Import the Indeed Scraper

After installation, right-click anywhere on a page, go to ‘Inspect’ and the developer tools console will pop up. Click on the tab Web Scraper and go on to the ‘Create new sitemap’ button and click on the ‘Import sitemap’ option. Now paste the JSON given below in the Sitemap JSON box. 

You can also copy it from Github – https://gist.github.com/scrapehero/c595899305db78de11ecf7a9c11d4a77

import-indeed-scraper

Obtaining the URL from Indeed

Indeed allows you to search jobs that you can filter based on parameters like distance, salary, location, company and experience level. We have filtered the jobs for full-time accountants in Los Angeles, California. You can edit the metadata by clicking on the sitemap drop-down and enter URL based on your choice of job filter. – In the Web Scraper toolbar, click on the Sitemap button and select the “Edit metadata’ option and paste the new URL (based on your filter) as the Start URL.

url-indeed-scraper

Run the Scraper

Go to the Sitemap and click ‘Scrape’ from the drop down. A new instance of Chrome will launch, enabling the extension to scroll and grab the data. Once the scrape is complete, the browser will close automatically and send a notification.

run-scraper-indeed

Download the Data

To download the scraped data as a CSV file that you can open in Microsoft Excel or Google Sheets, go to the Sitemap drop down > Export as CSV > Download Now.

We can help with your data or automation needs

Turn the Internet into meaningful, structured and usable data


Please DO NOT contact us for any help with our Tutorials and Code using this form or by calling us, instead please add a comment to the bottom of the tutorial page for help

Disclaimer: Any code provided in our tutorials is for illustration and learning purposes only. We are not responsible for how it is used and assume no liability for any detrimental usage of the source code. The mere presence of this code on our site does not imply that we encourage scraping or scrape the websites referenced in the code and accompanying tutorial. The tutorials only help illustrate the technique of programming web scrapers for popular internet websites. We are not obligated to provide any support for the code, however, if you add your questions in the comments section, we may periodically address them.

Posted in:   Job Postings, Web Scraping Tutorials

Responses

H June 21, 2019

When I paste this into the importer box it says it’s invalid. Any help?

Reply

Syed June 23, 2019

When I copy/paste the JSON from Github, it says invalid JSON. Please help

Reply

Comments or Questions?

Turn the Internet into meaningful, structured and usable data   

Enjoying our Tutorials?

Subscribe to our weekly updates on the latest tutorials in Web Scraping and Data Extraction