Elevate Your Egnyte Expertise. Join our Customer Community to connect with a network of peers and share game-changing strategies. Join Today

Welcome to
Help Desk

Product Updates
Training
Support
Ideas Community Contact Support

Content Classification FAQs

What Does the Sensitive Content Icon Next to My Files in Collaborate Mean?

What Files Does Egnyte Secure & Govern Classify?

How Do Compressed Archives Get Classified?

Why is the Classified Content Size in the Sensitive Content Tab Smaller than the Source Size?

Which Types of Files Undergo OCR for Text Extraction During Advanced Classification?

How Many Custom Policies with Custom Keywords Can I Create?

Why Did the Progress Bar in the Sensitive Content Tab Get Reset?

 

What Does the Sensitive Content Icon Next to My Files in Collaborate Mean?

This icon indicates that the files contain basic sensitive information, including items such as Social Security Numbers, Credit Card Numbers, Banking Information and Passport Numbers. Users should carefully review the contents of these files before sharing them.

What Files Does Egnyte Secure & Govern Classify?

Within any paths that you have not excluded, Egnyte Secure & Govern classifies files under 65MB in size that has any of the file types or file extensions listed below. Files such as videos and email archives are not classified as of now. 

File type

Extension
Microsoft Office files since Office 2007

.docx .docm .dotx .dotm

.xlsx .xlsm .xltx .xltm .xlsb

.pptx .pptm .potx .potm .ppam .ppsm .ppsx .sldx .sldm

Legacy Microsoft Office formats .doc .docb .doct .dot .xls .xlt .xlm .ppt .pot .pps
Postscript and PDF formats .ps .pdf
Other spreadsheet formats  .csv .tsv
Email storage formats .msg .mbox
OpenDocument formats .odt .ods .odp
Rich text and simple text formats .rtf .txt
Compressed archives (non-encrypted) .zip .7z .bz2 .cpio .jar .rar .tar .gz .xz .tgz
Image files .jpg .jpeg .png .tif .tiff .bmp
Google Suite files .gsheet .gdoc .gslides

Audio / Video Files (Ultimate Plan only)

Note: When first enabled, only audio/video files created in the last 90 days will be classified 

 

Video -

.mp4 .mov .wmv .avi .mts .flv .f4v .mkv .mpg .mpeg .ogv .webm .qt

Audio -

.mp3 .aac .ogg .flac .alac .wav .aiff .dsd .pcm .m4a .wma

Other file types

Note: Classification of .dwg files requires the Specialized File Handler add-on.

.epub .html .xml .xhtml .ooxml .odf .dwg

 

How Do Compressed Archives Get Classified?

In the case of compressed archives that are not password-protected, Egnyte Secure & Govern will automatically unpack these archives. Any files inside that are under 65MB in size and have any of the above file extensions will be classified. The entire archive is treated as a single sensitive file and the classification results for the contained files will be displayed together under the archive's file name in the Sensitive File Viewer.

 

Why is the Classified Content Size in the Sensitive Content Tab Smaller than the Source Size?

The progress bar in the Sensitive Content tab may indicate a smaller size of classifiable content than the sizes of the added sources. This is because Egnyte Secure & Govern will only classify files inside paths that have not been excluded, which belong to the list of valid file types and file extensions and are under 65MB in size. The size indicated in the progress bar is the amount of content in the added sources that matched these criteria.

 

Which Types of Files Undergo OCR for Text Extraction During Advanced Classification?

Within any paths that you have not excluded, Egnyte Secure & Govern will perform OCR to extract text from images, postscript files, and pages of PDF files in which photos or scans of documents are detected.

How Many Custom Policies with Custom Keywords Can I Create?

You may create up to 15 custom policies, each with their own list of custom keywords.

 

Why Did the Progress Bar in the Sensitive Content Tab Get Reset?

The classification progress bar in the Sensitive Content tab will regress to zero if you either create a new custom policy containing custom keywords or modify the custom keywords in an existing policy. The new or modified keywords need to be matched against all the files in the added sources, which can take some time. 

Was this article helpful?
0 out of 0 found this helpful

For technical assistance, please contact us.