Find more Data Mining And Management Remote Jobs posted recently Worldwide

Required Data Analytics freelancer for Write Program To Merge List Of Leads By Finding Similar Company And Contact Names job

Posted at - May 13, 2021

Toogit Instant Connect Enabled


We have scraped data on buildings in New York City.

Each building can have up to 3 owners and 1 Management company (or 4 owners and no management company)

(NYC buildings are expensive and often times owners partner together to buy buildings)

Each owner can own an indefinite number of buildings (depending on how wealthy they are).

Given that each building is a partnership of numerous owners, new business entities (companies) are created when each building is purchased.

That means that each building has owners (people) as well as a company that 'owns' the building.

That also means that each owner can be associated with numerous different companies.

There are a few Many to Many relationships here (however there is always only one building)

In addition to that, sometimes an owner can use the same company to buy 2 buildings but since we're dealing with scraped data that's as good as the person who entered that data on the city's platform, very often there are slight differences in spelling between the two company names or even between the two owner names of a building, making a straight comparison impossible.

(For example, there could be one building owned by 'The Carlton Group' (company name), which is owned by 'John Marks' and 'Greg Smith', and another building owned by 'Carlton Group', which is owned by 'Jonathan Marks' and 'Gregory Smith'.)

so far we've been manually comparing the data to look for duplicates.

The goal is to write a program that will merge and then divide all the data into 3 master lists of:

companies
contacts
buildings

so that all similar companies are merged into one company.

all similar contacts are merged into one contact.

we want the program to include an audit log that shows what the old data was and what the new data is. that will make the manual part easier so we're just manually looking over what the program changed.

The program will allow us to enter in different leads at a later time and run it through the same process.

About the recuiterMember since May 20, 2018 Soumendra Saha
from Zaghwan, Tunisia

Skills & Expertise Required

Data Analytics 

Candidate shortlisted and hiredHiring open till - Jun 12, 2021

Work from Anywhere
40 hrs / week
Fixed Type
Remote Job
$347.22
Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions


Apply on more work from home jobs posted in Data Mining And Management category.