Remote Data Mining And Management Job In Data Science And Analytics

Web scraping, data extraction/cleaning with Python

Find more Data Mining And Management remote jobs posted recently Worldwide

We are a startup looking to obtain relatively clean data from more or less clean sources online.

Your job would be to scrape different websites, that include more structured data (tables) as well as more unstructured data (text). Some of the data can be obtained with simple URL requests (wget, requests, urllib), while other websites you will need to do searches including selecting filters and clicking buttons that require javascript (for example using selenium).

We would like the code to collect the data several times a day using a cron job, ideally set up on AWS EC2. Your code should be written in Python.

We would start with a one-off project for a few of the sites we are interested in and if we are happy with the person, potentially extend to an ongoing contractor arrangement.

Skills needed:
python, web scraping, requests, urllib, selenium, mongoDB, SQL, data extraction, data acquisition, data cleaning, databases, automation, scripting, cron jobs

We are flexible with payments. We can work hourly or on a fixed price basis, depending on the experience and time and cost estimate of the freelancer. We are hoping to spend less than 1000 dollars for the first 2-3 websites, with the scraper, cron job and database setup included.
About the recuiter
Member since Mar 14, 2020
Mr.uma Shankar
from Amazonas, Brazil

Open for hiringApply before - Sep 11, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$958.31

Cost

Offer to work on this project closes in 122 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Google App Script Project

We are looking for a Google App Script wizard, who helps us to automatize Google Slide presentations based on product specs stored in a Google Spreadsheet inkl. pulling images from Google drive.

EXTRACT DATA FROM WEBSITE INTO EXCEL FILES

Extract data from website geologimarche in this website you have to extract using option by Provincia di residenza . We have extract to different type of Provinces ; Ancona, Ascoli Piceno, Fermo, Macerata, Pesaro Urbino.

Wordpress Plugin Developer

1. wordpress plugin development. We have some customization needs for our Wordpress-based web app, we will need you to realize such customization needs.
2. wordpress -AWS EC2 integration. Some of our core tech modules are hosted on AWS EC2, we w...read more

Web Scrapers required for Vietnam

I have a financial aggregator website and we need a resource that can help us in data extraction/ web scraping for the website in Vietnamese.

Python Scraping Pro Needed for Sportsbook scraping site

Want to create a website similar to (Removed by Toogit Admin) with odds for NBA, NFL, MLB, NHL and maybe more.

Would need to pull data from same sportbooks listed in link.

End goal is being able to display the data via wodpress site...read more