All Articles

Sports Data – The Rise of Big Data and Analytics

Sports Data – The Rise of Big Data and Analytics

The global sports market is huge with its total revenue projected to be around 90 billion dollars in 2017. In sports big data and web scraping is one of the sectors in which data analytics has demonstrated great value and has a great potential with major professional sports teams putting them to use. It is […]

Amazon vs Walmart – Products Sold in April 2017

Amazon vs Walmart – Products Sold in April 2017

As of April 2017, Walmart has a total of 23.5 million products on sale while Amazon has 332 million products. Walmart has only 7% of what Amazon has to offer. Walmart is the second-largest online mass merchandiser behind Amazon, but it’s only a distant second. With $136 billion in total sales last year, Amazon has […]

How to Solve Simple Captchas using Python Tesseract

How to Solve Simple Captchas using Python Tesseract

CAPTCHA stands for Completely Automated Public Turing test to tell Computers and Humans Apart. As the acronym suggests, it is a test used to determine whether the user is human or not. A typical captcha consists of a distorted test, which a computer program cannot interpret but a human can (hopefully) still read. This tutorial will […]

Number of Products Sold on Amazon.com – April 2017

Number of Products Sold on Amazon.com – April 2017

Amazon.com has a total of 335,765,099 million products as on April 4th, 2017. That is only 1% less than March 2017. Amazon had 337 million products on March 7th, 2017. Top 10 Categories Over the past 3 months, Digital Music has been dominating in the top 10 categories with a total of 67.1 million products, a slight growth […]

How to Parse Addresses using Python and Google GeoCoding API

How to Parse Addresses using Python and Google GeoCoding API

Web scraping can often lead to you having scraped address data which are unstructured. If you have come across a large number of freeform address as a single string, for example – “9 Downing St Westminster London SW1A, UK”,  you know how hard it would be to validate, compare and deduplicate these addresses. To start […]

How to Scrape Expedia using Python and LXML

How to Scrape Expedia using Python and LXML

Learn how to scrape flight details from Expedia.com, a leading travel and hotel site, using Python 3 and LXML in this web scraping tutorial. You’ll learn how to extract flight details such as flight timings, plane names, flight duration and more for a given source and destination.

Amazon vs Walmart- Products Sold in March 2017

Amazon vs Walmart- Products Sold in March 2017

As of March 2017, Walmart has a total of 22 million products on sale and Amazon has 375 million products. Walmart has only 5.8% of what Amazon has to offer. Competition with Amazon Walmart has seen a quantum leap with e-commerce sales picking up. With Walmart’s earnings in the fourth quarter seeming to be a definite blow to Amazon, […]

Number of Products sold on Amazon.com- March 2017

Number of Products sold on Amazon.com- March 2017

Amazon has a total of 337,173,768 products as on March 7th, 2017 That’s 7% less than February 2017. Amazon had 362 million products on February 4th, 2017. Top 10 Categories   Just like the preceding month, Digital Music category leads with 66.3 million products. The Home & Kitchen category has overtaken Electronics by 2.1 million products […]

Amazon vs Walmart – Products sold in February 2017

Amazon vs Walmart – Products sold in February 2017

As of February 2017, Walmart has a total of about 20 Million products on sale and Amazon has 371 million products. Walmart still has only 5.4% of the products that Amazon has to offer. Walmart enjoyed its best quarter in 4 years with its 2016 Q4 earnings hitting $133.6 billion. In e-commerce, they had a 29% increase boosted by the acquisition of Jet.com, […]

How to scrape Yelp Business Details using Python and LXML

How to scrape Yelp Business Details using Python and LXML

This tutorial is a follow-up of How to scrape Yelp.com for Business Listings using Python. In this tutorial, we will help you in scraping Yelp.com data from the detail page of a business. You can use URLs of businesses you are interested in OR the ones you got from part one of this tutorial. Let’s […]

Number of Products sold on Amazon.com- February 2017

Number of Products sold on Amazon.com- February 2017

Amazon has a total of 362,160,574 products as on February 4th, 2017 That’s 5% less than January 2017. Amazon had 398 Million products on January 4, 2017. Top 10 Categories The Digital Music category is the largest, with nearly 65.6 Million products. The Digital Music category has surpassed Electronics by 14 million products compared to last […]

Data Extraction Services – an essential guide and checklist

Data Extraction Services – an essential guide and checklist

Data is everywhere but most of it is unusable because it is not in a format that can be used. Data extraction services help tap the vast data resources available online or within internal sources and extract the data so that it can be used to benefit the business. This post is a data extraction services […]

Turn the Internet into meaningful, structured and usable data