Remote Data Mining And Management Job In Data Science And Analytics

Regular Expressions (RegEx) for extracting data from unstructured text documents

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a repository of text files produced by different authors. In each of these documents, there are a set of discrete data-points that I wish to extract. While the different authors use different templates and formatting to produce their respective documents, each document is attempting to provide values for the same master set of data-points that I wish to extract.

My team has developed a framework that utilizes regular expressions to automatically process our different document templates. The goal of this contract is to engage developers to review sample instances of the assigned document template, and then produce the regex statements to accurately and consistently extract the necessary values.
* Applicants will be provided with samples of the document template being assigned
* Applicants will be asked to submit at least 3 extraction rules to demonstrate an understanding of the parser framework, and also to demonstrate skill in producing robust and accurate matching rules.

Our team is committed to providing timely feedback and support in order to ensure a successful contract completion.
About the recuiter
Member since May 20, 2018
Ricky Goodall
from Saskatchewan, Canada

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Open for hiringApply before - Jun 15, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$34.49

Cost

Offer to work on this project closes in 38 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Website Data Management

We are looking to hire a website data management team to help us input and organize our ever-growing data.

Our Goal:
Streamline the daily data entry task.
Spot insights in the data that will help our team be more efficient.
read more

Data collection

Collection 950 phone numbers and paste them into spread sheet

looking for Automata expert

Exercise 1.
Assessment Indicators:
LO1 The finite state machines produced
The construction of combined machines
LO2 The simulation and test cases
The definition of equivalence
You are given the following vending machine speci...read more