Remote Data Mining And Management Job In Data Science And Analytics

Scraping CMS data

Find more Data Mining And Management remote jobs posted recently Worldwide

Need to scrape CMS data, and deliver scraping script (in Python), as well as as a database (SQLite) with the data

Looking for a person to scrape (using python, ideally requests library) to do the following steps:

1) Scrape CPSC data: a file downloaded per month for all available years (full version, not abridged)

2) Scrape plan cross walk: a file downloaded for every year available

(removed by Toogit admin)

3) Scrape STARS measures: I want the fall Report_Card_Master_Table excel for each year of data (in the zip), then only extract the Measure_data tab from that excel


3) Create SQLite (or open source) database
SQLite database should have the following:
a) One table that contains all the rows from CPSC_enrollment_info files (but add 2 columns, 1 for year and 1 for month depending on entry from file)
b) One table that contains all the rows from CPSC_contract_info files (but add 2 columns, 1 for year and 1 for month depending on entry from file)
c) One table that contains all the rows from plan crosswalk (add 1 column depending on year of the file)
d) One table with the measure_data from the STARS file (add 1 column depending on the year file)

Deliverable:

1) Python script
2) SQLite (or other open source alternative) database with all the tables mentioned above
About the recuiter
Member since Aug 31, 2017
Natalie
from Hunan, China

Skills & Expertise Required

Data Scraping MySQL Programming Python 

Candidate shortlisted and hired
Hiring open till - May 15, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$95.79

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Mariadb optimization for 200 million records

I need someone that optimize my mariadb servers settings to improve the performance for these needs:

A table with 200 million records and 20 gigas size.
The table has 21 partitions, and is replicated in master-master mode with 2 servers<...read more

Unreal Engine 4 specialist for Residential Apartment immersive walkthrough

From a 3d max or Revit model develop a stream line process to convert .3ds or .rvt into immersive walkthroughs using unreal engine 4. The idea will be to create a photo realistic walkthrough of different architectural 3d spaces with the unreal engine...read more

Data Enrichment of social media post data

We have a list of 50.000 mentions from posts of Instagram, and we need to identify the ones that are connected to a company

For each Company we will need to find:
- Company Name
- Linkedin Page (if not possible URL)
- Linkedin I...read more

Writer for Java/Spring Tutorial Blog

Were looking for programmers to write high-quality articles on Java/Spring topics. Wed generally like to keep it focused on back-end and back-end infrastructure:

- Java (Java SE, Java EE)
- The Spring Framework
- Spring Boot
-...read more

I need a SQL query written for my database and a way to view and segment data.

Im looking for a programmer that can write a SQL query based on the data we need and then import it to Access database so we use the data.