Posted Thu, 29 Mar 2018 12:26:19 GMT by Casey Miller Director of Technical Services

Has anyone figured out a way to detect duplicate documents on a cloud system? With no access to the database, I can't figure a way out to do it.

Posted Thu, 29 Mar 2018 13:47:34 GMT by Josef Zayats

Find out your File Cabinet database table name (here is FCTABLE) and names of fields that determine your duplicity criteria  (here VENDOR_ID and INVOICE_NUMBER) 

1. Create a new field in the FC that will indicate non-duplicates (DSTATUS)

2.Create an Autoindex Workflow as following:

External Data Source: Database Connection

Iterated records are in External Data Source - THIS IS IMPORTANT!!!

your FC - no filter (unless fully understand the need of a filter)

Database Connection - Select GUID for your cloud database connection



Process First found Document - THIS IS IMPORTANT!!!

Assignment Index Replacement: DSTATUS=NOT DUPLICATE

3. Run the Workflow.

4.Run a Search, observe the result list - all duplicates should have field DSTATUS empty.

5.(Optional and to be used with care only if fully understand) Create a deletion Workflow


Posted Fri, 30 Mar 2018 12:05:14 GMT by Casey Miller Director of Technical Services

This is great, Josef. Thank you!

Posted Fri, 11 Jan 2019 15:09:40 GMT by Steve Shriver Application Specialist


This sounds interesting but clarify this for me a little. First, I am using an on premises system so I think what your saying is to set up a database connection using the same file cabinet as if it was an external source, and iterate this source using the SQL statement. Then configure the match codes and use the repcement option in the status field. Have I got it? You said that the 'Process first found document' is important, why is that? Could you send me a screen shot or two of how you have yours set up?

Many thanks, you have been very helpful. You can email me at if that is more convenient.

You must be signed in to post in this forum.