Remote Web Development Job In IT And Programming

Bulk Document Identification and Data Extraction

Find more Web Development remote jobs posted recently Worldwide

I would like to setup a data extraction process to extract data from PDFs or image files for import into a database. There are about 5 structured documents (usually PDFs but could be image files) which are the primary focus and maybe 100 more which need to be identified and tagged/indexed but not necessarily have data extracted from them. The process could something like this:

1. Import a PDF file with 500-1000 pages.
2. Run ocr/indexing process
3. Export searchable pdf with individual documents indexed as bookmarks or separate file.
4. Export data from structured documents
5. Export document inventory

I hear abbyy is a decent software to accomplish this but open to ideas.


About the recuiter
Member since May 20, 2018
Iain R.
from Scotland, United Kingdom

Candidate shortlisted and hiredHiring open till - Jun 12, 2020

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$12.53

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Data Mining Health & Beauty E-Commerce Stores

Hey guys, I run an consultancy focused on serving the health & beauty e-commerce niche.

Im looking for someone to help me build a list of contacts of e-commerce business owners.

Here are two websites that list out a lot of them....read more

Crawling Property data

- A python programmer who is specialized in web data scraping
- Data mining/extraction/engineer who specializes in dealing with google recapture issue

We can purchase proxy API. I will share some extra details with shortlisted candidate...read more

Google Chrome Extension for Data Agglomeration and Database Queries

We are looking to build a Google Chrome extension in two phases: 1 simple phase and 1 more complex phase. It would be best if the same candidate were to succeed in both phases so we will describe both of them below:

Requirements:
1. Abil...read more

Transform SQL file into CSV WordPress posts import file

Hi,

For importing posts into a WordPress website, we only have a .SQL export.

Please, could anyone convert this file into an importable CSV file for WordPress?