17
Jan

Web Scraping Real Estate Site

Posted By admin
data collection

Project Title: Web Scraping Real Estate Site

Project Description:
Script, preferably written in Perl, but Python also considered, to extract houses information from given website into a JSON file or a SQLite database.

The site is https://caribbeanhomesbonaire.com/for-sale/
The script must travel through all pages.
Information both from the listing pages as from the property pages.

Info scraped:
status : new listing ; sold
street address
neighbourhood
property type (house, apartment, plot)
lot size m2
object size m2
price $ and Eu
URL to details page (as unique identifier?)
year built
kind of poperty
bedrooms
bathrooms
description (html)
photo thumbnail in listing
additional photo’s in separate table
creation_date
modification_date

Scraped data written in SQLite database
ideally all photo’s are also downloaded and stored as a binay ‘blob’ in SQLite table, with 1:N relation to property_object
Upon subsequent scrapes, only new or altered (price reduction) data needs to be saved in the database.

The scripts must use free and open source tools only and not be dependant on any 3rd party service.

The objective is to run the script maybe every 6 hours or so via cron to update the database.

Upon successful completion, about 3 or 4 more sites must be scraped in additional projects.

I look forward to hearing back from you.

For similar work requirement feel free to email us on info@webscrapingexpert.com.

Add a comment