Scraping Glassdoor Companies Data
Project Title: Scraping Glassdoor Companies Data
For every company for which there is an “overview” on Glassdoor, I would like the following data points collected. As a final product, I would like a csv file where each row is a company and there is a column for each of the variables listed below. These variables should be named using the name in parentheses below. There should be a row for every company that has an overview page on Glassdoor. There will be missing data points for some companies. Glassdoor reports that they have around 1 million companies on their website.
Number of employees (size)
Type Private, public, NGO, etc.
Geographic location of headquarters (hq)
Country of headquarters based on hq
State of headquarters if in the US based on hq
City of headquarters based on hq
Year of founding (founded)
Average Overall employee rating out of five stars (rating_oa)
Total number of overall employee rating reviews (rating_oa_num)
Diversity and Inclusion rating out of five stars (rating_di)
Total number of Diversity and Inclusion Reviews (rating_di_num)
In the attached file I provide a screenshot of a company profile to illustrate where precisely these data points are located on a given company overview page.
For similar work requirements feel free to email us on firstname.lastname@example.org.