Scraping Hundreds of State Court Websites
Project Title: Scraping Hundreds of State Court Websites
We actually have a much more complex problem. We’re looking to scrape hundreds of state court websites.
We currently are managing about 100 ourselves. But it’s not our core business so we want to outsource some or all of that function.
Many of the websites require a configuration file in order for the scraper to know which cases to download. In addition, some have CAPTCHA (we currently use Amazon’s mechanical turk).
Here’s a sample court website: http://www.lacourt.org/casesummary/ui/index.aspx?casetype=civil
As you can see, you need a case number to search.
Is this something you could tackle or outside the scope of what you might handle?
For similar work requirements feel free to email us on firstname.lastname@example.org.