We require a script to be built that will:
Scrape required website
Clean the results (a 3 step repetitive process on every scrape.
Then scrape more data from the remaining URLs on the same site.
Ideally this would then sync to a third scrape of a second website with the results of the second scrape.
Full Architecture map including scraping tags will be provided as we already have this working locally, would need this to be run in a cloud based environment using multiple IPs under master/slave system. Though would like input on best practises for this.
Full details will be provided on application.
Skills & Expertise RequiredData Mining
Offer to work on this project closes in 288 days!Submit A Proposal
Share this project with your friends