Our website deals with 10s of THOUSANDS of Organisations, each of them will have their own native PDF content. From that organisation we may require about 6 types of documents data.
We will have a non coder configure the data they need to extract in a nice UX experience (with suggestions from our code) - SAVE that structure to that organisation ID. I do like coders that read spex so the key WORD here each communication with me will be 'JSON'. Then every PDF that is dragged on the the HTML5 page will call the ORG structure and document type - extract the data required.
A very important part of the data extract is the LINES / GRID data that forms a commercial invoice.
We will not use image OCR to extract data as the accuracy of the data is critical.
About the recuiterMember since Jul 26, 2017 Apparel Wholesa
from Bergamo, Italy