Scraping Amazon Reviews
Project Title: Scraping Amazon Reviews
I came across your website and wondered if you might be able to help with a mini project I am working on. I want to calculate the average ratings of products on Amazon
If I go to the amazon site and search for apple I get a page with a bunch of products here http://www.amazon.com/s/ref=nb_sb_noss?url=search-alias%3Daps&field-keywords=apple which I can refine by selecting “Apple” from the list of brands in the left menu bar to narrow the search to only products made by Apple ie http://www.amazon.com/s/ref=sr_nr_p_89_0?fst=as%3Aoff&rh=i%3Aaps%2Ck%3Aapple%2Cp_89%3AApple&keywords=apple&ie=UTF8&qid=1451732876&rnid=2528832011.
From this, there are two options of things I would like to calculate:
Basic version: Calculate the average review (1-5) for all the Apple branded products (ie to go through the pages, extract the number of reviews for each product and the rating of each product. Add together all the versions of “number reviews x ave rating” and divide by the total number of ratings.
Ideal version: Calculate the average rating in each year by going through the individual product pages, extracting all the review ratings and dates of those reviews (for all products sold by a specific brand) and then working out the mean value of reviews made in 2015, 2014 etc.
We could cap the number of pages/products examined in each case to limit the size of the project, if this is easier.
Is this a project you believe can be done using google sheets and if so, would you be able to let me know if you would be interested in working on this as a small project?
For similar work requirement feel free to email us on firstname.lastname@example.org.