After setting up the content classification policies in the Settings page, Egnyte Secure and Govern starts matching content in the sources against the selected policies. The user does not have to wait for the content classification process to complete to view the results; matched files and folders are surfaced in the Sensitive Content tab as soon as they are found.
This tab shows a sortable list of locations in the connected sources which contain files with sensitive content. For each location, the user can see the content source, the number of sensitive files immediately within the location, the level of risk associated with the sensitive content and the last updated timestamp.
The sortable list of locations is limited to displaying 250,000 results. The user can fine tune the results in order to get under the limit by using the filter options located at the top.
Sensitive Content Risk
Users can find the locations with the most sensitive content using their Risk scores. Egnyte Secure and Govern calculates a Risk score for each file and each location on a scale of 1 to 9 based on the type of contained sensitive content. Locations with more files with more sensitive content will have higher Risk scores. For example, a folder containing files with highly sensitive information such as employee Social Security numbers will have Risk scores closer to 9, while a folder containing just files with telephone numbers will have Risk scores closer to 1.
Location Details
When the user clicks on the row for a location, they can see more details about the sensitive content found inside the location.
These details include:
- The content classification policies that were matched with the files inside the location. For example, if a file inside a folder matched the PCI-DSS policy, the PCI-DSS tag would be visible in the Details pane when you click on the folder’s row.
- The complete list of sensitive files in the folder and the actual sensitive information found in them.
- A link to open the location outside Egnyte Secure and Govern in another browser tab in the case of Egnyte Connect and SharePoint Online sources.
The User can also use a tool to permit any sensitive content detected in the location, as long as it matches certain classification policies. For example, the Finance folder may be permitted to have sensitive content matching the GLBA built-in policy but it is not permitted to contain internal R&D files that are matched by a custom policy.
Content Classification Progress
The process of classifying content in all the sources is quite intensive, especially when the user turns on content classification for the first time. Egnyte Secure and Govern can take anywhere from a few hours to a week, depending on the size of the sources, to perform an initial classification of all the content. Users can see the progress of the content classification process in the Sensitive Content tab.
Egnyte Secure and Govern only classifies certain files in the content sources.
Skipped Files
Even among the classifiable files in the content sources, Egnyte Secure and Govern may not be able to classify certain files. These include files that have been password-protected, corrupt files, or files that could not be processed for some other reason. The users can view the count of such skipped files in the progress bar. They can also break down the count by the reason why the file was not classified or export the list of these skipped files as a CSV.
Limitations
The export limits currently applicable are as listed below:
- Skipped files - 2.5M rows (50 files with 50k rows per file)
- Filtered locations - 2.5M rows (50 files with 50k rows per file)
- Files in filtered locations - 2.5M rows (50 files with 50k rows per file)
- All locations - no limit
- File labeling error report - 500k rows in 1 file