Welcome to
Help Desk

Product Updates
Training
Support
Ideas Contact Support

Understanding Sanitization Options in Migration App

Audience

Any user of the Migration App.

Context

Sanitizing is the function of renaming files and folders to map to Egnyte naming conventions. After every scan of the source, the migration details page will provide details on any unsupported or conflicting file names. During sanitizing, all unsupported characters are replaced with an underscore, which will allow them to be migrated. Files that are not sanitized cannot be migrated to Egnyte.

For long-time users of Migration App, please be advised that sanitize is no longer an atomic command but is offered as an option with every migration metacommand

Options Dialog Box

When the Migrate Data or True-Up Data option is selected, a modal with additional options will appear:

Migration_App_Sanitizing_Understanding_1.jpg

  1. On Selecting Yes, sanitize on the source, any unsupported characters in filenames will be changed on the source to an underscore before migrating.
  2. On Selecting Yes, sanitize on the fly, unsupported characters in filenames will be changed during migration and nothing will be changed on the source.
  3. On Selecting No, do not sanitize, this metacommand will skip sanitizing altogether. Any unsupported files will not be migrated.

Note that regardless of whether sanitize is selected, if there are no unsupported characters in files on the source, the metacommand will skip the step altogether.

After making the selections in the modal, click Migrate Data or True-Up Data to proceed the migration job.

Sanitize Settings in Scheduler

  1. Navigate to Advanced Options > Scheduler and select the toggle to On.
    Understanding santization options 1.png
  2. Select the desired sanitize option and click on Save.
    Understanding santization options 2.png
    For other settings in the scheduler, refer to this article.  

Considerations

  • If no sanitization is required on the source, the Migration App will automatically skip the step, irrespective of the selected option.
  • If the original datasource is on Mac/Linux and uploaded using Windows CMM Agent, the special characters (\ , : , . , / , " , < , > , | , * , ?) will not be visible in Windows. This is a limitation in Windows OS and cannot be fixed. The end result is that the file will be visible and can be migrated, but on Egnyte it will appear with the unsupported character replaced with a random character.
  • The Unicode character ('\u007F') cannot be recognized. As a result, a file with this character in the name will not be migrated and will not show up in the Migration Report because it is not recognized by Windows. Egnyte will continue to seek a root cause for this.
  • If two files have identical names except the Unicode characters, then sanitizing will replace the Unicode with '_' which means both files will end up with the same name.
    • Example: on source 2 files with name:

      X3vKPFq␋avU2ygQT2h5UMpr
      X3vKPFq⇥avU2ygQT2h5UMpr

      Both then become:

      X3vKPFq_avU2ygQT2h5UMpr

    • If the user selects Sanitize on source, the second file will overwrite the first file, and only one file will be left to migrate.
    • However, if they choose Sanitize on-the-Fly, this results in one file with two versions of the file on the destination.

Unsupported Characters

By default, after sanitizing in Migration App, the following unsupported characters in filenames are replaced by an underscore ("_") on the Source:

Unsupported Standard Characters

a space at the beginning or end of a file name
\ : . / "

|

*

?

Note: Space means an actual space (" ")

 

Unsupported Unicode Characters

\u2028

\u2029

\u21b9

\u21e4

\u21e5

\u2409

\u240b

\u0001

\u0002

\u0003

\u0004

\u0005

\u0006

\u0007

\u0008

\u0009

\u000A

\u000B

\u000C

\u000D

\u000E

\u000F

\u0010

\u0011

\u0012

\u0013

\u0014

\u0015

\u0017

\u0018

\u0019

\u001A

\u001B

\u001C

\u001D

\u001E

\u001F

\u007F

\u0080

\u0081

\u0082

\u0083

\u0084

\u0085

\u0086

\u0087

\u0088

\u0089

\u008A

\u008B

\u008C

\u008D

\u008E

\u008F

\u0090

\u0091

\u0092

\u0093

\u0094

\u0095

\u0096

\u0097

\u0098

\u0099

\u009A

\u009B

\u009C

\u009D

\u009E

\u009F

 

 

 

These limitations are not imposed by Migration App independently, but rather reflect Egnyte's overall character support and file type limitations. To find out more, see this Helpdesk Article: Unsupported Characters, and File Types.

Questions? Feature Requests? Other Feedback?

For any feature requests or suggestions, feel free to submit them on our Product Board, and be sure to mention Migration App in the text.

For more complex requests that would benefit from providing screenshots or other attachments, submit them to support@egnyte.com. Be sure to include Migration App in the email title. The Support team will push the comments to the Product team for consideration.

 

 

Was this article helpful?
0 out of 0 found this helpful

For technical assistance, please contact us.