Unicode normalization (NFC/NFD)?

Hi,

macOS stores file names in a unicode encoding (NFD), which is different to the encofing other OSes like Windows use (NFC). Linux file systems, for example can use both encodings. These different encodings may lead to problems like double or invisible files when shared with Samba.

For compatibility reasons, it would be nice to have some sort of unicode normalization during upload on macOS (to NFC), maybe as configurable setting.

Rclone, for example, has 2 flags (–local-unicode-normalization and --no-unicode-normalization) to force unicode translating to NFC during upload or to force using the same encoding used on the file system.

So my question is, how does Insync handle unicode encoding? Will it just upload files with the same encoding as used by the file systems, or is some normalising happening in the background? If not, might a setting like this be an option for the upcoming releases?

Thank you.

1 Like

Hello! Let me check this with our engineer and I’ll update accordingly!

@Buhnie Apologies for not updating sooner! I will check once more with our team.

Thank you, @mia. I have the impression that this NFC/NFD dilemma is leading to a lot of duplicate “space 1” files (‘file.jpg’ uploaded as ‘file 1.jpg’).

1 Like

@Buhnie Thank you for patiently waiting!

Could you let us know what your use case is for further investigation?

Is there any update on this?

It would be great to have some documentation on the expected behaviour when using Insync with e.g. Google Drive and a mix of macOS and Linux systems.

For example, I have run into this when trying to check consistency between Insync’s copies of my Google Drive on macOS and Linux using rsync (dry run!). The normalisation behaviour does not seem very consistent. For example, running something like convmv -r -f utf8 -t utf8 --nfc . on both the macOS and Linux copies maintained by Insync does not give consistent results.

Hi @olupton! We have not addressed this improvement yet. I have noted this for our engineer’s reference, but cannot guarantee when it’ll be included in our pipeline. Rest assured that posting on our Forums like you did helps with visibility so other users can chime in and therefore help bump this up in the priority list. Thank you for your understanding!