What files does Egnyte Secure & Govern classify?
How do compressed archives get classified?
Why is the classified content size in the Sensitive Content tab smaller than the source size?
Which types of files undergo OCR for text extraction during advanced classification?
How many custom policies with custom keywords can I create?
Why did the progress bar in the Sensitive Content tab get reset?
What files does Egnyte Secure & Govern classify?
Within any paths that you have not excluded, Egnyte Secure & Govern classifies files under 65MB in size that has any of the file types or file extensions listed below. Files such as videos and email archives are not classified as of now.
File type |
Extension |
Microsoft Office files since Office 2007 |
.docx .docm .dotx .dotm .xlsx .xlsm .xltx .xltm .xlsb .pptx .pptm .potx .potm .ppam .ppsm .ppsx .sldx .sldm |
Legacy Microsoft Office formats | .doc .docb .doct .dot .xls .xlt .xlm .ppt .pot .pps |
Postscript and PDF formats | .ps .pdf |
Other spreadsheet formats | .csv .tsv |
Email storage formats | .msg .mbox |
OpenDocument formats | .odt .ods .odp |
Rich text and simple text formats | .rtf .txt |
Compressed archives (non-encrypted) | .zip .7z .bz2 .cpio .jar .rar .tar .gz .xz .tgz |
Image files | .jpg .jpeg .png .tif .tiff .bmp |
Google Suite files | .gsheet .gdoc .gslides |
Other file types | .epub .html .xml .xhtml .ooxml .odf |
How do compressed archives get classified?
In the case of compressed archives that are not password-protected, Egnyte Secure & Govern will automatically unpack these archives. Any files inside that are under 65MB in size and have any of the above file extensions will be classified. The entire archive is treated as a single sensitive file and the classification results for the contained files will be displayed together under the archive's file name in the Sensitive File Viewer.
Why is the classified content size in the Sensitive Content tab smaller than the source size?
The progress bar in the Sensitive Content tab may indicate a smaller size of classifiable content than the sizes of the added sources. This is because Egnyte Secure & Govern will only classify files inside paths that have not been excluded, which belong to the list of valid file types and file extensions and are under 65MB in size. The size indicated in the progress bar is the amount of content in the added sources that matched these criteria.
Which types of files undergo OCR for text extraction during advanced classification?
Within any paths that you have not excluded, Egnyte Secure & Govern will perform OCR to extract text from images, postscript files, and pages of PDF files in which photos or scans of documents are detected.
How many custom policies with custom keywords can I create?
You may create up to 15 custom policies, each with their own list of custom keywords.
Why did the progress bar in the Sensitive Content tab get reset?
The classification progress bar in the Sensitive Content tab will regress to zero if you either create a new custom policy containing custom keywords or modify the custom keywords in an existing policy. The new or modified keywords need to be matched against all the files in the added sources, which can take some time.