This article covers the usage of the Duplicate File Remediation tool in the Content Lifecycle section of Secure & Govern. This feature allows the user to create “duplicate file jobs” which can be used to identify and delete duplicate files within an individual Egnyte content source. Duplicate files can be deleted individually or in bulk based on how the user wants to address them. 

How to Use Duplicate File Remediation

To locate duplicate files, navigate to the Content Lifecycle tab.

Click on a folder location for which you would want to visualize the duplicate files. On the Duplicate Files widget, you can see the total number of files duplicated elsewhere and there will be two options:

  • Export list of duplicate files, which will generate a spreadsheet listing all of the duplicate files
  • Generate list of duplicate files, which will start the process of creating a duplicate file job through which you can delete duplicate files

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_1.jpeg

Once you select the Generate list option, you will be prompted to create a name for the list being generated.. Enter a name for your list and select Create. If you do not wish to create a job, select Close

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_2.jpeg

After creating a job, you can view all lists by clicking the View Duplicate Lists button in the top right corner of the Content Lifecycle View

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_3.jpeg

All current and previous lists will be shown in the list view, with any that are still searching for duplicates indicated with a spinning icon in the size column

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_4.jpeg

Each duplicate files list that has been created will be visible in the duplicate file lists view. The view will show the job name, the source specific to the job, the path the list was created from, the days left until the job expires, the total size of the duplicate files in that list and when the list was created. There will also be a menu available for each list with 3 options:

  • Show list - view and delete all of the duplicate files found
  • Refresh list - rescan the location to update the duplicate files list
  • Delete job - remove the duplicate files list

You can also Create a new list (which will be based off the last selected folder in the Content Lifecycle tree) or Close the file lists view

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_5.jpeg

If you select Show list, you will be presented with a view of all the duplicate files found. 

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_6.jpeg

Click any of the file names in the list for additional details regarding the duplicates for that specific file. Information for each file will include:

  • File name
  • Size
  • Location
  • Creation date

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_7.jpeg

In the file view, there will be several actions that can be taken:

  • Delete selected - delete all of the files selected from the list
  • Preview file - show the contents of the files
  • Show permissions - show the permissions of the file in the S&G permissions viewer
  • Export list - generate a spreadsheet listing all of the duplicate files in that view

When deleting files, you will be presented with a confirmation dialog and the option to enter comments. Those comments will be visible in the S&G audit logs.

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_8.jpeg

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_9.jpeg

While a file deletion is in progress, an icon will appear next to the files selected for deletion.

Secure_and_Govern_Content_Lifecycle_Duplicate_File_Remediation_10.jpeg

Files will remain in the “deleting” status until file deletion is confirmed and the page is refreshed.

Frequently Asked Questions

Q: What sources can I remediate duplicate files on?

A: Duplicate file remediation is available only for Egnyte sources

Q: How long is a duplicate file list available for?

A: Duplicate file lists are available for 30 days. Expired lists will still be visible in the list view until they are deleted.

Q: How many active duplicate file lists can I have?

A: There is a limit of 10 active duplicate file lists. To create a new list if 10 already exist, you will need to delete an existing list or wait for one of the active lists to expire.