Remote Data Mining And Management Job In Data Science And Analytics

Web scraping and PDF scraping for real estate data

Find more Data Mining And Management remote jobs posted recently Worldwide

The goal is to extract data first from a series of PDF tax rolls (document published by a county tax assessor for each city in the county with a list of all properties in each city). I would like to extract the data for a certain type of property within each PDF document.

The next phase of the project would be to take the data extracted from the PDFs and web scrape one or more web sources for additional data on those same properties from the PDFs.

If possible I would like to discuss strategy and approach for this type of project before hiring because I am not highly experienced in this area and you may have a better idea.
About the recuiter
Member since Mar 14, 2020
Rahul Naidu
from Echternach, Luxembourg

Skills & Expertise Required

Data Scraping Web Scraping 

Candidate shortlisted and hiredHiring open till - Apr 29, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$478.93

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Data Enrichment of social media post data

We have a list of 50.000 mentions from posts of Instagram, and we need to identify the ones that are connected to a company

For each Company we will need to find:
- Company Name
- Linkedin Page (if not possible URL)
- Linkedin I...read more

data scrape of real estate agents

just need real estate agents cell phone numbers, time in business, and state

Python Selenium Chrome webdriver

Looking for someone who can quickly (within 24h) solve an issue we are having with the Python Selenium chrome webdriver.

Web scraper to collect data from multiple sources

I am researching real estate in many counties around the country. Each county provides a list of property identifiers matching certain criteria in an XLS or CSV format. For a given county, I would like to read the file into a database, and then suppl...read more