Remote Data Mining And Management Job In Data Science And Analytics

Regular Expressions (RegEx) for extracting data from unstructured text documents

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a repository of text files produced by different authors. In each of these documents, there are a set of discrete data-points that I wish to extract. While the different authors use different templates and formatting to produce their respective documents, each document is attempting to provide values for the same master set of data-points that I wish to extract.

My team has developed a framework that utilizes regular expressions to automatically process our different document templates. The goal of this contract is to engage developers to review sample instances of the assigned document template, and then produce the regex statements to accurately and consistently extract the necessary values.
* Applicants will be provided with samples of the document template being assigned
* Applicants will be asked to submit at least 3 extraction rules to demonstrate an understanding of the parser framework, and also to demonstrate skill in producing robust and accurate matching rules.

Our team is committed to providing timely feedback and support in order to ensure a successful contract completion.
About the recuiter
Member since Sep 4, 2017
Ashok Kumar
from Bihar, India

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Candidate shortlisted and hiredHiring open till - Aug 10, 2021

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$25.04

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Data Enricher/Miner for LinkedIn prospects

I have a large volume of prospect data / b2b data. I have their names, company names, job titles, location, but I dont have their email addresses.

Im looking for a data miner/enricher that can provide me with email addresses for the prosp...read more

Informatica for SAP

We are looking for a resource / company who can install and implement Informatica. The requirement is to use Informatica as a middleware to extract data from SAP ECC6 (these could include tables, function modules, reports etc) into a SQL Server dataw...read more

RDLC report development needed

We need to add an RDLC report to our website. We have already worked with a designer to create the design of the report layout. The design is in Adobe InDesign format, and we need someone to translate that into an actual RDLC report.

A few...read more