Shopify scrape
1. Intro
In this project we will gatering the competitors pricing of Vivo x200, and all of its variations.
The online stores that will get their price data are: AverageDad, PDAplaza, WondaMobile.
Witiout going into the code and the complications I faced, here is a quick EDA from the data.
Price distribution of all Vivo x200's variations across the three websites.
WondaMobile have the least price distributions.
PDAPlaza, with the highest distribution, have the lowest entry prices.
Date of lunch and prices of all Vivi x200's variations across the three websites.
PDAPlaza publish/create all its product in one day window, unlike AverageDad and WondaMobile.
PDAPlaza was late on the initial Vivo x200 launch.
2. PS
As of July first, PDAplaza have switched from Shopify to WordPress/WooCommerce.
- Using Webarchive we can take a look at the website's HTML
- This is their new website's HTML
The "product link + .json" method, example https://averagedadofficial.com/products/vivo-x200-ultra.json, no longer works.
Thus urging the need for a tailored scraping script.
This shows how data scraping process should be adapting to new frameworks, thankfully the changes isn't in security/anti-bot area.