Remote Data Mining And Management Job In Data Science And Analytics

Iterative webscraping and text extraction into Pandas format

Find more Data Mining And Management remote jobs posted recently Worldwide

I would like to download 7 years of data from a courts website

I want all the data that is initially offered on a case (name, court, judgment, etc); as well as some data that may be in a document within each case. Only 200 case results can be returned at a time, so there will be substantial iterative searching. The deliverable should be in Pandas.

There is a recaptcha I am not a robot check box on the initial search.

Search bar does allow wild cards. Format for searching a given year is DC-year-_ _ _ _ _ (five digit case number). Only searching for closed cases. Iterative searching for DC-year-_ _ _ _ _ in 200 case increments seems to make the most sense. Only need certain types of cases, but each case needs to be opened and scraped for data.

Following results would be needed: cause number (i.e. DC-year-_ _ _ _ _), case type, location (court), filing date, end date, judgment amount, type of case.
About the recuiter
Member since May 20, 2018
Dharmawan Dharm
from Wiltshire, United Kingdom

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Candidate shortlisted and hiredHiring open till - Jan 30, 2021

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$19.47

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Milesplit Athlete Stats Webscrape

I need certain track meet results posted on this website:
http://(Removed by Toogit Admin)/results
I would like the results in a .csv or .xls spreadsheet. I need results for all available months, for years from 2005-2017, and for levels midd...read more

working with Data statistics

Python Analyst working with financial data and statistics
Part time - Flexible hours working
one on one data

Python Big Data Developer

- Analyze existing system using Python Linux version (data sources, structures, dependencies)
- Timeline Design and documentation
- Design and create DB architecture (choosing partitioning principals, creating Impala DB structures)...read more