Scrape popular apps from Apple App Store (iTunes Store) using Google Chrome

Using Web Scraper Chrome Extension you can extract data from iTunes App Store and get information from popular apps – this article shows you how. Apple provides a list of top/popular apps in each category in their app store. This web scraper will start with the URL – https://itunes.apple.com/us/genre/ios/id36?mt=8
and extract the basic data from popular apps within each category.

What data are we extracting?

Here is a list of data fields we will be extracting from each app:

  1. Name
  2. Age
  3. Developer
  4. Tagline
  5. Rating
  6. Pricing
  7. Seller
  8. Size

Below is an annotated screenshot:

itunes-data-fields

itunes-data-fields-extract

Prerequisites

  • Google Chrome Browser – You will need to download the Chrome browser. The extension requires Chrome 49+.
  • Web Scraper Chrome Extension – The Web Scraper extension can be downloaded from the Chrome Web Store.  After downloading the extension you will see a spider icon in your browser toolbar.
For detailed steps, you can watch the video below or continue reading –

Import the iTunes Scraper

Using Web Scraper extension, you can create a sitemap that shows how the website should be traversed and what data should be extractedThe setup process is fairly simple, you can follow some of our other Web Scraper Extension tutorials or Documentation if you need to know more.

We have already made a scraper you can copy paste into the extension.

Import Scraper

After you have installed the extension right-click anywhere on a page, go to ‘Inspect’ and the Developer Tools console will pop up. Click on the tab ‘Web Scraper’ and go on to the ‘Create new sitemap’ button and click on the ‘Import sitemap’ option. Now paste the JSON below into the Sitemap JSON box.

{  
    "_id":"itunes_store",
    "startUrl":[  
       "https://itunes.apple.com/us/genre/ios/id36?mt=8"
    ],
    "selectors":[  
       {  
          "id":"app",
          "type":"SelectorLink",
          "parentSelectors":[  
             "cateogry"
          ],
          "selector":"div.column a",
          "multiple":true,
          "delay":0
       },
       {  
          "id":"name",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"h1.product-header__title",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"age",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"span.badge",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"developer",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"h2.product-header__identity",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"tagline",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"h2.product-header__subtitle",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"rating",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"figcaption.we-rating-count",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"pricing",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"ul.product-header__list:nth-of-type(2) li.product-header__list__item",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"seller",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"div.information-list__item:nth-of-type(1) dd.information-list__item__definition",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"size",
          "type":"SelectorText",
          "parentSelectors":[  
             "app"
          ],
          "selector":"div.information-list__item:nth-of-type(2) dd.information-list__item__definition",
          "multiple":false,
          "regex":"",
          "delay":0
       },
       {  
          "id":"cateogry",
          "type":"SelectorLink",
          "parentSelectors":[  
             "_root"
          ],
          "selector":"a.top-level-genre",
          "multiple":true,
          "delay":0
       }
    ]
 }

Copy the JSON above and import into Web Scraper Extension. You can also copy it from Github – https://gist.github.com/scrapehero/f2739d90c7956ab02be7530942b3d1fa

Run the Scraper

To start scraping, go to the Sitemap and click ‘Scrape’ from the drop down. A new instance of Chrome will launch, enabling the extension to scroll and grab the data. Once the scrape is complete, the browser will close automatically and send a notification.

Download the Data

To download the scraped data as a CSV file that you can open in Microsoft Excel or Google Sheets, go to the Sitemap drop down > Export as CSV > Download Now.

We can help with your data or automation needs

Turn the Internet into meaningful, structured and usable data


Please DO NOT contact us for any help with our Tutorials and Code using this form or by calling us, instead please add a comment to the bottom of the tutorial page for help

Disclaimer: Any code provided in our tutorials is for illustration and learning purposes only. We are not responsible for how it is used and assume no liability for any detrimental usage of the source code. The mere presence of this code on our site does not imply that we encourage scraping or scrape the websites referenced in the code and accompanying tutorial. The tutorials only help illustrate the technique of programming web scrapers for popular internet websites. We are not obligated to provide any support for the code, however, if you add your questions in the comments section, we may periodically address them.

Posted in:   Web Scraping Tutorials

Comments or Questions?

Turn the Internet into meaningful, structured and usable data   

Enjoying our Tutorials?

Subscribe to our weekly updates on the latest tutorials in Web Scraping and Data Extraction

ScrapeHero Logo

Can we help you get some data?