Sie sind offline. Dies ist eine schreibgeschützte Version der Seite.
Toggle navigation
Foren
Knowledge Base
Status
Downloads
Supportanfragen
Alle
Alle
Webseiten
Foren
Knowledge Base Artikel
Suchfilter
Alle
Webseiten
Foren
Knowledge Base Artikel
Antworten finden
Deutsch
Deutsch
Español
Français
English
日本語
Anmelden
Home
Community Foren
Questions about Usage and Configuration
Conversion from PDF to JPEG for better results before send to tray
Conversion from PDF to JPEG for better results before send to tray
Veröffentlicht
Mon, 23 Nov 2020 07:29:29 GMT
von
Tiago Matos
CoreForm Business Technology -
Software Implementation Specialist
Hi all
We have an customer that is already using DocuWare but it is not happy with the manual work required once the tray OCR result don't bring supplier name and ABN(Australia Business Number) from document header.
I was checking the PDF on adobe reader and looks like the header of the PDF is a image and the rest of the document is a normal PDF structure.
After some tests we figure out that converting the PDF to JPEG before importing to tray, improved the OCR result bringing the supplier name and the ABN.
So we are thinking to provide an solution that is going to convert the PDFs to JPEGs before importing into the tray.
This issue is only happening for just some suppliers documents. But they receive a lot of documents from these suppliers.
The customer is receiving all documents via email so one solution could be:
Solution A:
- Create an app the is going to read the account payable email inbox folder
- For each email
- Get the attached PDFs and convert to JPEG.
- Create a new email with the same subject and email body
- Attached the JPEG images
- Move the original email to /pdf folder
- Send email to the final email that DocuWare is reading
Please feel free to give any advice or ask any question.
Quick note. (I imported the same document into Amazon Textract and the OCR result was much better.)
Thanks
Tiago
Screenshot 2020-11-23 182731.png (472,3 KB)
Veröffentlicht
Wed, 25 Nov 2020 03:43:35 GMT
von
Craig Heintz
SE
Try this
KBA-36285
I have found it overcomes the poor ocr on pdf image files. This is mainly needed for v7+ as it used to be set to "true" in version 6
Veröffentlicht
Wed, 25 Nov 2020 05:09:15 GMT
von
Tiago Matos
CoreForm Business Technology -
Software Implementation Specialist
Hi Craig,
Thanks for helping.
Sorry but I forgot to say that we are not using Desktop Apps to import documents into DocuWare.
We are getting the documents from customer's gmail inbox folder.
Also we are using DocuWare Cloud.
Thanks
Tiago
Sie müssen angemeldet sein um Beiträge in den Foren zu erstellen.
Hilfe erhalten
Suche in Knowledge Base
Community fragen
Neue Supportanfrage