Welcome to
Help Desk

Product Updates
Training
Support
Ideas Contact Support

Understanding Sanitization Options in Migration App

Follow

Audience

Any user of the Migration App.

Context

Sanitizing is the function of renaming files and folders to map to Egnyte naming conventions. After every scan of the source, the migration details page will provide details on any unsupported or conflicting file names. During sanitizing, all unsupported characters are replaced with an underscore, which will allow them to be migrated. If you opt not to sanitize these files, they cannot be migrated to Egnyte. 

For long-time users of Migration App, please be advised that sanitize is no longer an atomic command but is offered as an option with every migration metacommand

Options Dialog Box

When you select Migrate Data or True-Up Data, an options modal will appear:

Migration_App_Sanitizing_Understanding_1.jpg

  1. If you select Yes, sanitize on the source, any unsupported characters in filenames will be changed on the source to an underscore before migrating.
  2. If you select Yes, sanitize on the fly, unsupported characters in filenames will be changed during migration and nothing will be changed on the source.
  3. If you select No, do not sanitize, this metacommand will skip sanitizing altogether. Any unsupported files will not be migrated.

Note that regardless of whether sanitize is selected, if there are no unsupported characters in files on the source, the metacommand will skip the step altogether.

Once you have made your selections in the modal, click Migrate Data or True-Up Data and your migration job will proceed.

Sanitize Settings in Advanced Options

Alternatively, you can set your preferences for a given migration job  by using Advanced Options.

  1. Navigate to Advanced Options in the left column of the Migration Details page.
  2. Click on Sanitize to view the dialog box for this feature.

    Migration_App_Sanitizing_Understanding_2.jpg
  3. Choosing Always - On the Source means that any unsupported characters in filenames will be changed on the Source to an underscore before migrating.
  4. Choosing to Always - On-the-Fly means that unsupported characters in filenames will be changed during migration and nothing will be changed on the source.
  5. Choosing Never will skip sanitizing altogether.
  6. If you elect to keep the default setting of Ad Hoc, the UI will prompt you to make a choice every time you run a command.
  7. Once you have made your selections in the modal, click Save.

Considerations

  • If there is nothing on the source that needs to be sanitized, Migration App will skip the step, regardless of what you have selected.
  • If the original datasource is on Mac/Linux and uploaded using Windows CMM Agent, the special characters (\ , : , . , / , " , < , > , | , * , ?) will not be visible in Windows. This is a limitation in Windows OS and cannot be fixed. The end result is that the file will be visible and can be migrated, but on Egnyte it will appear with the unsupported character replaced with a random character.
  • The Unicode character ('\u007F') cannot be recognized. As a result, a file with this character in the name will not be migrated and will not show up in the Migration Report because it is not recognized by Windows. We will continue to seek a root cause for this.
  • If two files have identical names except the Unicode characters, then sanitizing will replace the Unicode with '_' which means both files will end up with the same name.
    • Example: on source 2 files with name:

      X3vKPFq␋avU2ygQT2h5UMpr
      X3vKPFq⇥avU2ygQT2h5UMpr

      Both then become:

      X3vKPFq_avU2ygQT2h5UMpr

    • If you choose Sanitize on source, the second file will overwrite the first file, and only one file will be left to migrate.
    • But if you choose Sanitize on-the-Fly, this results in one file with two versions of the file on the destination.

Unsupported Characters

By default, after sanitizing in Migration App, the following unsupported characters in filenames are replaced by an underscore ("_") on the Source:

Unsupported Standard Characters

a space at the beginning or end of a file name
\ : . / "

|

*

?

Note: Space means an actual space (" ")

 

Unsupported Unicode Characters

\u2028

\u2029

\u21b9

\u21e4

\u21e5

\u2409

\u240b

\u0001

\u0002

\u0003

\u0004

\u0005

\u0006

\u0007

\u0008

\u0009

\u000A

\u000B

\u000C

\u000D

\u000E

\u000F

\u0010

\u0011

\u0012

\u0013

\u0014

\u0015

\u0017

\u0018

\u0019

\u001A

\u001B

\u001C

\u001D

\u001E

\u001F

\u007F

\u0080

\u0081

\u0082

\u0083

\u0084

\u0085

\u0086

\u0087

\u0088

\u0089

\u008A

\u008B

\u008C

\u008D

\u008E

\u008F

\u0090

\u0091

\u0092

\u0093

\u0094

\u0095

\u0096

\u0097

\u0098

\u0099

\u009A

\u009B

\u009C

\u009D

\u009E

\u009F

 

 

 

These limitations are not imposed by Migration App independently, but rather reflect Egnyte's overall character support and file type limitations. To find out more, see this Helpdesk Article: Unsupported Characters, and File Types.

Questions? Feature Requests? Other Feedback?

If you have feature requests or suggestions, feel free to submit them on our Product Board, and be sure to mention Migration App in the text.

For more complex requests that would benefit from providing screenshots or other attachments, you may submit them to support@egnyte.com. Be sure to include Migration App in the email title. Our Support team will push your comments to the Product team for consideration.

 

 

Was this article helpful?
0 out of 0 found this helpful

For technical assistance, please contact us.